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Provided is a method of identifying agonists and antagonists of nuclear receptors that comprises measuring agonist-dependent 
fluorescence resonance energy transfer (FRET) between a fluorescently labeled nuclear receptor or ligand binding domain and fluorescently 
labeled CREB-binding protein (CBP), p300, other nuclear co-activator, or binding portion thereof. The method is simple, rapid, and 
inexpensive. Nuclear receptors and nuclear receptor co-activators labeled with fluorescent reagents for use in the above-described method 
are also provided. 
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TITLE OF THE INVENTION 

ASSAYS FOR NUCLEAR RECEPTOR LIGANDS USING FRET 



5 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional 
Application No. 60/061,385, filed 10/7/97, the contents of which are 
incorporated herein by reference in their entirety. 

10 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not applicable. 

REFERENCE TO MICROFICHE APPENDIX 
15 Not applicable. 

FIELD OF THE INVENTION 

This invention relates to methods of identifying novel 
agonists and antagonists of nuclear receptors utilizing the agonist- 
20 dependent interaction of such receptors with CREB-binding protein 

(CBP) or other nuclear receptor co-activators in which this interaction is 
detected by fluorescence resonance energy transfer. 

BACKGROUND OF THE INVENTION 

25 Nuclear receptors are a superfamily of Hgand-activated 

transcription factors that bind as homodimers or heterodimers to their 
cognate DNA elements in gene promoters. The superfamily, with more 
than 150 members, can be divided into subfamilies {e.g. the steroid, 
retinoid, thyroid hormone, and peroxisome proliferator-activated 

30 [PPAR] subfamilies). Each subfamily may consist of several members 
which are encoded by individual genes (e.g. PPARa, PPARy, and 
PPAR5). In addition, alternative mRNA splicing can result in more 
than one isoform of these genes as in the case of specific PPARs (e.g. 
PPARyl and PPARy2). The nuclear receptor superfamily is involved in 

35 a wide variety of physiological functions in mammalian cells: e.g., 

differentiation, proliferation, and metabolic homeostasis. Dysfunction 
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or altered expression of specific nuclear receptors has been found to be 
involved in disease pathogenesis. 

The PPAR subfamily of nuclear receptors consists of three 
members: PPARa, PPARy, and PPAR5. PPARa is highly expressed in 
5 liver and kidney. Activation of PPARa by peroxisome proliferators 
(including hypolipidimic reagents such as fibrates) or medium and 
long-chain fatty acids is responsible for the induction of acyl-CoA 
oxidase and hydratase-dehydrogenase (enzymes required for 
peroxisomal P-oxidation), as well as cytochrome P450 4A6 (an enzyme 

10 required for fatty acid ©-hydroxylase). Thus, PPARa has an important 
role in the regulation of lipid metabolism and is part of the mechanism 
through which hypolipidimic compounds such as fibrates exert their 
effects. PPARy is predominantly expressed in adipose tissue. Recently, 
a prostaglandin J2 metabolite, 15-Deoxy-D12,14-prostaglandin J2, has 

15 been identified as a potential physiological ligand of PPARy. Both 15- 
Deoxy-D12,14-prostaglandin J2 treatment of preadipocytes or retroviral 
expression of PPARy2 in fibroblasts induced adipocyte differentiation, 
demonstrating the role of PPARy in adipocyte differentiation and lipid 
storage. The demonstration that anti-diabetic and lipid-lowering 

20 insulin sensitizing compounds known as thiazolidinediones are high 
affinity ligands for PPARy suggests a broad therapeutic role for PPARy 
ligands in the treatment of diabetes and disorders associated with 
insulin resistance {e.g. obesity and cardiovascular disease). 

Nuclear receptor proteins contain a central DNA binding 

25 domain (DBD) and a COOH-terminal ligand binding domain (LBD). The 
DBD is composed of two highly conserved zinc fingers that target the 
receptor to specific promoter/enhancer DNA sequences known as 
hormone response elements (HREs). The LBD is about 200-300 amino 
acids in length and is less well conserved than the DBD. There are at 

30 least three functions for the LBD: dimerization, ligand binding, and 
transactivation. The transactivation function can be viewed as a 
molecular switch between a transcriptionally inactive and a 
transcriptionally active state of the receptor. Binding of a ligand which 
is an agonist flips the switch from the inactive state to the active state. 

35 The COOH-terminal portion of the LBD contains an activation function 
domain (AF2) that is required for the switch. 

-2- 
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The ligand-induced nuclear receptor molecular switch is 
mediated through interactions with members of a family of nuclear 
receptor co-activators (e.g., CBP/p300, SRC-l/NcoA-1, TIF2/GRIP- 
l/NcoA-2, and p/CIP). Upon binding of agonist to its cognate receptor 
5 LBD, a conformational change in the receptor protein creates a co- 
activator binding surface and results in recruitment of co-activator(s) to 
the receptor and subsequent transcriptional activation. The binding of 
antagonist ligands to nuclear receptors will not induce the required 
conformational change and prevents recruitment of co-activator and 

10 subsequent induction of transcription. The co-activators CREB-binding 
protein (CBP) and p300 are two closely related proteins that were 
originally discovered by virtue of their ability to interact with the 
transcription factor CREB. These two proteins share extensive amino 
acid sequence homology. CBP can form a bridge between nuclear 

15 receptors and the basic transcriptional machinery (Kamei et al., 1996, 
Cell 85:403-414; Chakravarti et al., 1996, Nature 383:99-103; Hanstein et 
al., 1996, Proc. Natl. Acad. Sci. USA 93:11540-11545; Heery et al., 1997, 
Nature 387:733-736). CBP also contains intrinsic histone 
acetyltransferase activity which could result in local chromatin 

20 rearrangement and further activation of transcription. Ligand- and 
AF2-dependent interaction between certain nuclear receptors and CBP 
has been demonstrated in in vitro pull down assays and far-western 
assays. This interaction is both necessary and sufficient for the 
transcriptional activation that is mediated by these nuclear receptors. 

25 Thus, an AF2 mutant of the estrogen receptor (ER) which abolishes the 
transcriptonal function of the receptor is incapable of interacting with 
CBP. 

The N-termini of CBP and p300 have been shown to interact 
with the ligand-binding domains of some nuclear receptors (Kamei et 
30 al., 1996, Cell 85:403-414, hereinafter "Kamei"). Kamei was able to 

demonstrate direct interaction of CBP and p300 with nuclear receptors 
by several different methods: 

(1) Kamei produced GST fusion proteins of the first 100 
amino acids of the N-terminus of CBP. These fusion proteins were run 
35 out on a polyacrylamide gel, transferred to a membrane, and the 
membrane was exposed to 32p-labeled ligand-binding domains of 

-3- 
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nuclear receptors. In the presence of ligand, a specific binding 
interaction between the CBP and nuclear receptor fragments was 
detected in that the 32p-l a beled ligand-binding domains were observed to 
bind to the bands on the membrane containing the GST-CBP fusion 
5 proteins. 

(2) Kamei also utilized the yeast two-hybrid system. The 
ligand-binding domain of the nuclear receptor fused to the DNA-binding 
domain of the LexA protein was used as bait. The amino terminal 
domain of CBP fused to the gal4 transactivation domain was used as 

10 prey. In the presence of ligand, a specific binding interaction (occurring 
in vivo, i.e., within the yeast) was observed between the CBP and nuclear 
receptor fragments. 

(3) Kamei observed ligand-induced binding between CBP 
and nuclear receptors via a gel-shift assay. This assay is based on the 

15 observation that, in the presence of ligand, nuclear receptors will bind to 
oligonucleotides containing their target recognition sequence. Such 
binding results in the formation of a nuclear receptor-ligand- 
oligonucleotide complex having a higher molecular weight than the 
oligonucleotide alone. This difference in molecular weight is detected 

20 via a shift in position of the 32p-l a beled oligonucleotide when it is nm out 
on a polyacrylamide gel. Kamei found that a fragment of CBP (the N- 
terminal 100 amino acids) was capable of binding to the nuclear 
receptor-ligand-oligonucleotide complex and shifting the complex's 
position on the gel to an even higher molecular weight. 

25 (4) Kamei was able to co-immunoprecipitate CBP using 

antibodies to nuclear receptors in extracts from a variety of cells in the 
presence of ligand. 

(5) By the use of transcriptional activation assays, Kamei 
was able to demonstrate that nuclear receptors and CBP interact in a 

30 functional manner. Such transcriptional activation assays can indicate 
that two proteins are involved in a pathway that results in 
transcriptional activation but these assays do not prove that the 
interaction between the proteins is one of direct binding. 

By the above-described methods, Kamei was able to 

35 demonstrate specific binding interactions between CBP and the retinoic 
acid receptor (RAR), glucocorticoid receptor (GR), thyroid hormone 
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receptor (T3R), and retinoid X receptor (RXR). Kamei also demonstrated 
specific binding between the N-terminus of p300 and RAR. However, 
Kamei did not demonstrate specific binding between CBP, p300, or any 
other nuclear receptor co-activators and PPARs. 
5 What is striking about the methods used by Kamei is their 

extremely laborious and time consuming nature. Such methods 
involve, among other things, the construction of fusion proteins, the 
preparation of 32p-i a beled proteins, the construction of specialized 
expression vectors for the yeast two-hybrid assay and the transcriptional 

10 activation assays, the running of many gels, and the raising of 

antibodies. Most of these assays take days to carry out and preparing the 
reagents needed to carry them out may take weeks. Because of the 
complicated reagents that are involved in these assays and the time 
needed to prepare and run the assays, these assays tend to be costly. 

15 Investigators other than Kamei who have studied the interaction 

between nuclear receptors and CBP have also been forced to rely on such 
cumbersome methods (see, e.g., Chakravarti et aL, 1996, Nature 383:99- 
103; Hanstein et al., 1996, Proc. Natl. Acad. Sci. USA 93:11540-11545; 
Heery et al., 1997, Nature 387:733-736). 

20 Kamei did not use the above-described methods to identify 

novel agonists or antagonists of nuclear receptors. The focus of Kamei 
was not on agonists or antagonists, but rather on the interaction 
between nuclear receptors and CBP. Although modifying the methods 
of Kamei to identify agonists or antagonists might be possible, such 

25 methods would suffer from serious disadvantages. This is because, as 
discussed above, all of the assays employed by Kamei to study the 
interaction of CBP and p300 with nuclear receptors are very laborious, 
slow, and costly. Given the therapeutic importance of steroid hormones 
such as estrogen, Cortisol, progesterone, and other nuclear receptor 

30 agonists such as thyroid hormone and antidiabetic thiazolidinedione 

compounds, the need for improved high-throughput screening assays to 
identify potential pharmaceutical compounds affecting nuclear 
receptors is clear. Historically, therapeutically useful nuclear receptor 
ligand compounds were identified by screening animal models, an 

35 approach which is even more labor intensive and time consuming than 
the methods used by Kamei. Also, approaches such as those used by 

-5- 



WO 99/18124 



PCTYUS98/21049 



Kamei are ill-suited for the identification of antagonists of nuclear 
receptors. It is now widely appreciated that antagonists of nuclear 
receptors can be valuable therapeutic agents. Examples of such 
therapeutically useful antagonists are tamoxifene, raloxifene, and RU- 
5 486. 

What is needed is a high throughput, time and labor- 
saving, non-radioactive, inexpensive, and very reliable assay for the 
identification and characterization of both agonists and antagonists of 
nuclear receptors. Such an assay is provided by the present invention. 

10 

SUMMARY OF THE INVENTION 

The present invention provides novel methods of identifying 
agonists and antagonists of nuclear receptors. The methods take 
advantage of the agonist-dependent binding of nuclear receptors and 

15 CBP, p300, or other nuclear receptor co-activators. In the absence of 
agonist, binding between the nuclear receptor and CBP, p300, or other 
nuclear receptor co-activators does not occur. If agonist is present, 
however, such binding occurs and can be detected by fluorescence 
resonance energy transfer (FRET) between a fluorescently-labeled 

20 nuclear receptor and fluorescently-labeled CBP, p300, or other nuclear 
receptor co-activator. Antagonists can be identified by virtue of their 
ability to prevent or disrupt the agonist-induced interaction of nuclear 
receptors and CBP, p300, or other nuclear receptor co-activators. In 
contrast to prior art methods of identifying agonists and antagonists of 

25 nuclear receptors, the methods of the present invention, are simple, 
rapid, and less costly. 

The present invention provides a nuclear receptor or ligand 
binding domain thereof labeled with a fluorescent reagent for use in the 
above-described methods of identifying agonists and antagonists of 

30 nuclear receptors. The present invention also provides CBP, p300, or 
other nuclear receptor co-activator, or a binding portion thereof, labeled 
with a fluorescent reagent. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a method of fluorescently labelling a 
protein or polypeptide with Europium cryptate (Eu3+K). 
5 Figure 2 illustrates the format for experiments 1 and 2 of 

Table 1. 

Figure 3 illustrates the format for experiment 3 of 

Table 1. 

Figure 4 illustrates the format for experiment 4 of 

10 Table 1. 

Figure 5 shows the results of studies using the methods of 
the present invention with four known PPARy agonists, -o- = AD5075; 
= Pioglitazone; -X-- = Troglitazone; ~0« = BRL49653. 

Figure 6 shows a measurement of the binding constant for 
15 the interaction between hCBP and PPARylLBD. 

Figure 7A shows the amino acid sequence of human CBP 
(SEQ.ID.NO.:l). 

Figure 7B shows the nucleotide sequence of a cDNA 
encoding human CBP (SEQ.ID.NO.:2). The open reading frame is at 
20 positions 76-1290. 

Figure 8A shows the amino acid sequence of human 
PPARa (SEQJD.NO.:3). 

Figure 8B shows the nucleotide sequence of a cDNA 
encoding human PPARa (SEQ.ID.NO.:4). The open reading frame is at 
25 positions 217-1623. 

Figure 9A shows the amino acid sequence of human 
PPARyl (SEQ.ID.NO.:5). 

Figure 9B shows the nucleotide sequence of a cDNA 
encoding human PPARyl (SEQ JD.NO.:6). The open reading frame is at 
30 positions 173-1609. 

Figure 10A shows the amino acid sequence of human 
PPAR8 (SEQ.ID.NO.:7). 

Figure 10B-C shows the nucleotide sequence of a cDNA 
encoding human PPAR5 (SEQ.ID.NO.:8). The open reading frame is at 
35 positions 338-1663. 
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DETAILED DESCRIPTION OF THE INVENTION 
For the purposes of this invention: 

- an "agonist" is a substance that binds to nuclear receptors 
in such a way that a specific binding interaction between the nuclear 

5 receptor and CBP or other nuclear receptor co-activator can occur, 

- an "antagonist" is a substance that is capable of preventing 
or disrupting the agonist-induced specific binding interaction between a 
nuclear receptor and CBP, p300, or another nuclear receptor co- 
activator. 

10 - a "ligand" of a nuclear receptor is an agonist or an 

antagonist of the nuclear receptor. 

- a "specific binding interaction,** "specific binding, " and 
the like, refers to binding between a nuclear receptor and CBP, p300, or 
other nuclear receptor co-activator which results in the occurrence of 

15 fluorescence resonance energy transfer between a fluorescent reagent 
bound to the nuclear receptor and a fluorescent reagent bound to CBP, 
p300, or other nuclear receptor co-activator. 

With respect to CBP, p300, or other nuclear receptor co- 
activators, a "binding portion" is that portion of CBP, p300, or other 

20 nuclear receptor co-activators that is sufficient for specific binding 
interactions with nuclear receptors. 

With respect to nuclear receptors, a "ligand binding 
domain" is that portion of a nuclear receptor that is sufficient to bind an 
agonist or antagonist of the nuclear receptor. 

25 The present invention provides a high throughput, time 

and labor-saving, non-radioactive, inexpensive, and very reliable assay 
for the identification and characterization of both agonists and 
antagonists of nuclear receptors. In a general embodiment, the present 
invention provides methods of identifying agonists and antagonists for 

30 any nuclear receptor for which CBP, p300, or another nuclear receptor 
binding protein is a co-activator. Such agonists and antagonists are 
identified by virtue of their ability to induce or prevent binding between 
the ligand binding domain of a nuclear receptor and CBP, p300, or other 
nuclear receptor co-activator. The interaction between the nuclear 

35 receptor and CBP, p300, or other nuclear receptor co-activator is 

monitored by observing the occurrence of fluorescence resonance energy 

-8- 
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transfer (FRET) between two fluorescent reagents. One fluorescent 
reagent is bound to the nuclear receptor; the other fluorescent reagent is 
bound to CBP, p300, or other nuclear receptor co-activator. The binding 
of fluorescent reagent to nuclear receptor, CBP, p300, or other nuclear 
5 receptor co-activator can be by a covalent linkage or a non-covalent 
linkage. 

The present invention makes use of fluorescence resonance 
energy transfer (FRET). FRET is a process in which energy is 
transferred from an excited donor fluorescent reagent to an acceptor 

10 fluorescent reagent by means of intermolecular long-range dipole-dipole 
coupling. FRET typically occurs over distances of about lOe to lOOfe and 
requires that the emission spectrum of the donor reagent and the 
absorbance spectrum of the acceptor reagent overlap adequately and that 
the quantum yield of the donor and the absorption coefficient of the 

15 acceptor be sufficiently high. In addition, the transition dipoles of the 
donor and acceptor fluorescent reagents must be properly oriented 
relative to one another. For a review of FRET and its applications to 
biological systems, see Clegg, 1995, Current Opinions in Biotechnology 
6:103-110. 

20 The present invention makes use of a nuclear receptor or 

ligand binding domain thereof labeled with a first fluorescent reagent 
and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof, labeled with a second fluorescent reagent. The second 
fluorescent reagent comprises a fluorophore capable of undergoing 

25 energy transfer by either (a) donating excited state energy to the first 
fluorescent reagent, or (b) accepting excited state energy from the first 
fluorescent reagent. In other words, according to the present invention, 
either the first or the second fluorescent reagents can be the donor or the 
acceptor during FRET. 

30 The first and second fluorescent reagents are 

spectropscopically complementary to each other. This means that their 
spectral characteristics are such that excited state energy transfer can 
occur between them. FRET is highly sensitive to the distance between 
the first and second fluorescent reagents. For example, FRET varies 

35 inversely with the sixth power of the distance between the first and 
second fluorescent reagents. In the absence of agonist, the first 
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fluorescent reagent, bound to the nuclear receptor or ligand binding 
domain thereof, will not be near the second fluorescent reagent, bound to 
CBP, p300, or other nuclear receptor co-activator, or binding portion 
thereof. Thus, no FRET, or very little FRET, will be observed. In the 

5 presence of agonist, however, interaction between the nuclear receptor 
and CBP, p300, or other nuclear receptor co-activator will occur, thus 
bringing close together the first and the second fluorescent reagents, 
allowing FRET to occur and be observed. 

Accordingly, the present invention provides a method of 

10 identifying an agonist of a nuclear receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; and 

15 (c) a substance suspected of being an agonist of the 

nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and CBP, p300, or other nuclear receptor co- 

20 activator, or a binding portion thereof, will occur; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

25 In particular embodiments, the nuclear receptor is selected 

from the group consisting of steroid receptors, thyroid hormone 
receptors, retinoic acid receptors, peroxisome proliferator-activated 
receptors, retinoid X receptors, glucocorticoid receptors, vitamin D 
receptors, and "orphan nuclear receptors" such as LXR, FXR, etc. 

30 In a particular embodiment, the nuclear receptor or ligand 

binding domain thereof is a full-length nuclear receptor. In another 
embodiment, the nuclear receptor or ligand binding domain thereof is a 
ligand binding domain of a nuclear receptor. In another embodiment, 
the nuclear receptor or ligand binding domain thereof comprises an AF- 

35 2 site of a nuclear receptor. 
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In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof is a full-length PPAR. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is the ligand 
binding domain of a PPAR. In a further embodiment, the PPAR is 
5 selected from the group consisting of PPARa, PPARyl, PPARy2, and 
PPAR5. In a further embodiment, the ligand binding domain of the 
PPAR contains amino acid residues 176-478 of human PPARyl. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof contains amino acids 143-462 of human RARoc. 

10 In another embodiment, the nuclear receptor or ligand binding domain 
thereof contains amino acids 122-410 of rat T3R0CI. In another 
embodiment, the nuclear receptor or ligand binding domain thereof 
contains amino acids 227-463 of mouse RXRy. In another embodiment, 
the nuclear receptor or ligand binding domain thereof contains amino 

15 acids 251-595 of human ER. 

In a particluar embodiment, the above-described methods 
utilize full-length CBP, either mouse or human. In other embodiments, 
the methods utilize amino acid residues 1-113 of human CBP. In 
another embodiment, the methods utilize amino acid residues 1-453 of 

20 human CBP. 

The conditions under which the methods described above 
are carried out are conditions that are typically used in the art for the 
study of protein-protein interactions: e.g., physiological pH; salt 
conditions such as those represented by such commonly used buffers as 

25 PBS; a temperature of about 4°C to about 55°C. The presence of 

commonly used non-ionic detergents, e.g., NP-40®, sarcosyl, Triton X- 
100®, is optional. When europium cryptates are used as fluorescent 
reagents, reactions should contain KF at a concentration of at least 200 
mM. 

30 Heery et al., 1997, Nature 387:733-736 showed that 

interactions between nuclear receptors and a variety of nuclear receptor 
co-activators are mediated by a short amino acid sequence in the nuclear 
receptor co-activators having the amino acid sequence LXXLL, where L 
is leucine and X represents any amino acid. Accordingly, the present 

35 invention can be practiced with a binding portion of a nuclear receptor 
co-activator, provided that the binding portion contains the amino acid 
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sequence LXXLL. Therefore, the present invention includes a method of 
identifying an agonist of a nuclear receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 
5 (b) a binding portion of a nuclear receptor co-activator, 

where the binding portion contains the amino acid sequence LXXLL, 
and where the binding portion is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
10 nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the binding portion of the nuclear receptor 
co-activator will take place; and 
15 (d) measuring fluorescence resonance energy transfer 

(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

In a particular embodiment, the nuclear receptor co- 
20 activator is selected from the group consisting of: human RIP- 140, 

human SRC-1, mouse TIF-2, human or mouse CBP, human or mouse 
p300, mouse TIF-1, and human TRIP proteins. 

In a particular embodiment, the nuclear receptor co- 
activator is human RIP- 140 and the binding portion includes a 
25 contiguous stretch of amino acids of human RIP- 140 selected from the 
group consisting of: positions 20-29, 132-139, 184-192, 266-273, 379-387, 
496-506, 712-719, 818-825, 935-944, and 935-942. 

In another embodiment, the nuclear receptor co-activator is 
human SRC-1 and the binding portion includes a contiguous stretch of 
30 amino acids of human SRC-1 selected from the group consisting of: 
positions 45-53, 632-640, 689-696, 748-755, and 1434-1441. 

In another embodiment, the nuclear receptor co-activator is 
mouse TIF-2 and the binding portion includes a contiguous stretch of 
amino acids of mouse TIF-2 selected from the group consisting of: 
35 positions 640-650, 689-699, and 744-754. 
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In another embodiment, the nuclear receptor co-activator is 
human or mouse CBP and the binding portion includes a contiguous 
stretch of amino acids of human or mouse CBP selected from the group 
consisting of: positions 68-78 and 356-366. 
5 In another embodiment, the nuclear receptor co-activator is 

human or mouse p300 and the binding portion includes a contiguous 
stretch of amino acids of human or mouse p300 selected from the group 
consisting of: positions 80-90 and 341-351. 

In another embodiment, the nuclear receptor co-activator is 
10 mouse TIF-1 and the binding portion includes a contiguous stretch of 
amino acids of mouse TIF-1 containing positions 722-732. 

In another embodiment, the nuclear receptor co-activator is 
human TRIP2 and the binding portion includes a contiguous stretch of 
amino acids of human TRIP2 containing positions 23-33. 
15 In another embodiment, the nuclear receptor co-activator is 

human TRIP3 and the binding portion includes a contiguous stretch of 
amino acids of human TRIP3 containing positions 97-107. 

In another embodiment, the nuclear receptor co-activator is 
human TRIP4 and the binding portion includes a contiguous stretch of 
20 amino acids of human TRIP4 containing positions 36-46. 

In another embodiment, the nuclear receptor co-activator is 
human TRIPS and the binding portion includes a contiguous stretch of 
amino acids of human TRIP5 containing positions 26-36. 

In another embodiment, the nuclear receptor co-activator is 
25 human TRIP8 and the binding portion includes a contiguous stretch of 
amino acids of human TRIP8 containing positions 36-46. 

In another embodiment, the nuclear receptor co-activator is 
human TRIP9 and the binding portion includes a contiguous stretch of 
amino acids of human TRIP9 selected from the group consisting of: 
30 positions 73-83, 256-266 and 288-298. 

For amino acid sequences of nuclear receptor co-activators, 
see Yao et al., 1996, Proc. Natl. Acad. Sci. USA 93:10626-10631 (SRC-1); 
0§ate et al., 1995, Science 270:1354-1357 (SRC-1); Cavaillfcs et al., 1995, 
EMBO J. 14:3741-3751 (RIP-140); Voegel et al., 1996, EMBO J. 15:101-108 
35 (TIF-2); Kwok et al., 1994, Nature 370:223-226 (CBP); Arias et al., 1994, 
Nature 370:226-229 (CBP); Eckner et al., 1994, Genes Dev. 8:869-884 
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(p300); Le Douarin et al., 1995, EMBO J. 14:2020-2033 (TIF-1); Lee et al, 
1995, Nature 374:91-94 (TRIP proteins). 

The particular embodiments of the present invention 
described above are all particular embodiments of a more general 
5 method that is also part of the present invention. That general method 
is a method of identifying an agonist of a nuclear receptor that 
comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 
10 (b) a polypeptide containing the amino acid sequence 

LXXLL where the polypeptide is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
nuclear receptor; 

15 under conditions such that, if the substance is an agonist of 

the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the polypeptide will take place; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

20 where the occurrence of FRET indicates that the substance 

is an agonist of the nuclear receptor. 

In a particular embodiment, the amino acid sequence 
LXXLL is present in an a helical portion of the polypeptide. In another 
embodiment, the amino acid sequence LXXLL is present in an a helical 
25 portion of the polypeptide and the leucines form a hydrophobic face. 

The present invention provides methods for identifying 
antagonists of a nuclear receptor. Such methods are based on the ability 
of the antagonist to prevent the occurrence of agonist-induced binding 
between a nuclear receptor and CBP, p300, or other nuclear receptor co- 
30 activator, or to disrupt such binding after it has occurred. Thus, the 
present invention provides a method for identifying antagonists of 
nuclear receptors that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 
35 (b) CBP, p300, or other nuclear receptor co-activator, or a 

binding portion thereof, labeled with a second fluorescent reagent; 
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(c) an agonist of the nuclear receptor; and 

(d) a substance suspected of being an antagonist of the 
nuclear receptor; 

under conditions such that, in the absence of the substance, 
5 binding between the nuclear receptor or ligand binding domain thereof 
and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof will occur; and 

(e) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents when the 

10 substance is present and measuring FRET between the first and second 
fluorescent reagents when the substance is absent; 

where the a decrease in FRET when the substance is 
present indicates that the substance is an antagonist of the nuclear 
receptor. 

15 In particular embodiments, the nuclear receptor is selected 

from the group consisting of steroid receptors, thyroid hormone 
receptors, retinoic acid receptors, peroxisome proliferator-activated 
receptors, retinoid X receptors, glucocorticoid receptors, vitamin D 
receptors, and "orphan nuclear receptors" such as LXR, FXR, etc. 

20 In a particular embodiment, the nuclear receptor or ligand 

binding domain thereof is a full-length nuclear receptor. In another 
embodiment, the nuclear receptor or ligand binding domain thereof is a 
ligand binding domain of a nuclear receptor. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is an AF-2 site of 

25 a nuclear receptor. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof is a full-length PPAR. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is the ligand 
binding domain of a PPAR. In a further embodiment, the PPAR is 

30 selected from the group consisting of PPARcc, PPARy, and PPAR8. In a 
further embodiment, the ligand binding domain of the PPAR contains 
amino acid residues 176-478 of human PPARyl. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof contains amino acids 143-462 of human RARcc. 

35 In another embodiment, the nuclear receptor or ligand binding domain 
thereof contains amino acids 122-410 of rat T 3 Rctl. In another 
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embodiment, the nuclear receptor or ligand binding domain thereof 
contains amino acids 227-463 of mouse RXRy. In another embodiment, 
the nuclear receptor or ligand binding domain thereof contains amino 
acids 251-595 of human ER. 
5 In a particular embodiment, the above-described methods 

utilize full-length CBP, either mouse or human. In other embodiments, 
the methods utilize amino acid residues 1-113 of human CBP. In 
another embodiment, the methods utilize amino acrid residues 1-453 of 
human CBP. 

10 The conditions under which the methods described above 

are carried out are conditions that are typically used in the art for the 
study of protein-protein interactions: e.g., physiological pH; salt 
conditions such as those represented by such commonly used buffers as 
PBS; a temperature of about 4°C to about 55°C. The presence of 

15 commonly used non-ionic detergents, e.g., NP-40®, sarcosyl, Triton X- 
100®, is optional. When europium cryptates are used as fluorescent 
reagents, reactions should contain KF at a concentration of at least 200 
mM. 

In principle, one could measure FRET by monitoring either 

20 (a) a decrease in the emission of the donor fluorescent reagent following 
stimulation at the donor's absorption wavelength and/or (b) an increase 
in the emission of the acceptor reagent following stimulation at the 
donor's absorption wavelength. In practice, FRET is most effectively 
measured by emission ratioing. Emission ratioing monitors the change 

25 in the ratio of emission by the acceptor over emission by the donor. An 
increase in this ratio signifies that energy is being transferred from 
donor to acceptor and thus that FRET is occurring. Emission ratioing 
can be measured by employing a laser-scanning confocal microscope. 
Emission ratioing is preferably done by splitting the emitted light from a 

30 sample with a dichroic mirror and measuring two wavelength bands 
(corresponding to the donor and the acceptor emission wavelengths) 
simultaneously with two detectors. Alternatively, the emitted light can 
be sampled consecutively at each wavelength (by using appropriate 
filters) with a single detector. In any case, these and other methods of 

35 measuring FRET are well known in the art. 



- 16- 



WO 99/18124 



PCT/US98/21049 



Although a variety of donor and acceptor fluorescent 
reagents can be used in the practice of the present invention, preferred 
embodiments of the present invention make use of cryptates of 
fluorescent reagents as donor reagents. Inclusion of a substrate into the 
5 intramolecular cavity of a macropolycyclic ligand results in the 
formation of a cryptate. The macropolycyclic ligand shields the 
substrate from interaction with solvent and other solute molecules. If 
the substrate is a fluororescent reagent, formation of a cryptate may 
result in markedly different spectroscopic characteristics for the reagent 

10 as compared to the spectroscopic characteristics of the free reagent. 

The present invention includes the use of europium (EuHI) 
or terbium (TbUI) cryptates as donor fluorescent reagents. Such EuHI or 
Tblll cryptates, as well as methods for their formation, are well known 
in the art. For example, see Alpha et al., 1987, Angew. Chem. Int. Ed. 

15 Engl. 26:266-267; Mathis, 1995, Clin. Chem. 41:1391-1397. A europium 
cryptate is formed by the inclusion of a europium ion into the 
intramolecular cavity of a macropolycyclic ligand which contains 
bipyridine groups as light absorbers. When europium cryptates are 
present in solution together with fluoride ions, a total shielding of the 

20 europium cryptate fluorescence is occurs. The molecular structure of a 
europium cryptate is shown below. 



NH 2 NH 2 
C 2 H 4 C 2 H 4 
NH NH 
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Europium cryptates can be conjugated to proteins by the use 
of well-known heterobifunctional reagents (see, e.g., International 
Patent Application WO 89/05813; Prat et al., 1991, Anal. Biochem. 
195:283-289; Lopez et al., 1993, Clin. Chem. 39:196-201). 
5 The present invention includes the use of XL665 as the 

acceptor fluorescent reagent. XL665 is a crosslinked derivative of 
allophycocyanin (APC). APC is a porphyrin containing protein which is 
derived from the light harvesting system of algae (Rronick, 1986, M. 
Immunol. Meth. 92:1-13). XL665 has an absorption maximum at «620 

10 nm and an emission maximum at 665 nm. In some embodiments of the 
invention, XL665 is labeled with streptavidin in order to effect the 
binding of the streptavidin-labled XL665 to a biotin-labeled substance, 
e.g. , CBP or the ligand binding domain of a nuclear receptor. 
Streptavidin labeling of XL655 and biotin labeling of CBP, or the ligand 

15 binding domain of a nuclear receptor, can be performed by well known 
methods. 

In a preferred embodiment of the invention, XL665 as the 
acceptor fluorescent reagent is combined with Europium cryptate 
(Eu3+K) as the donor fluorescent reagent. Europium cryptate (Eu3+K) 

20 has a large Stokes shift, absorbing light at 337 nm and emitting at 620 
nm. Thus, the emission maximum of Europium cryptate (Eu3+K) 
overlaps the absorption maximum of XL665. Europium cryptate 
(Eu3+K) has a large temporal shift; the time between absorption and 
emission of a photon is about 1 millisecond. This is advantageous 

25 because most background fluorescence signals in biological samples are 
short-lived. Thus the use of a fluorescent reagent such as europium 
cryptate, with a long fluorescent lifetime, permits time-resolved 
detection resulting in the reduction of background interference. 

The spectral and temporal properties of europium cryptate 

30 (Eu3+K) result in essentially no fluorescence background and thus 

assays using this fluorescent reagent can be carried out in a "mix and 
read" mode, greatly facilitating its use as a high throughput screening 
tool. For the embodiment using Europium cryptate (Eu3+K) and XL665, 
the measuring instrument irradiates the sample at 337 nm and 

35 measures the fluorescence output at two wavelengths, 620 nm (B counts, 
europium fluorescence) and 665 nm (A counts, XL665 fluorescence). 
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The extent of flurorescent resonance energy transfer is measured as the 
ratio between these two values. Typically this ratio is multiplied by 
10,000 to give whole numbers. 

Other FRET donor-acceptor pairs are suitable for the 
5 practice of the present invention. For example, the following donor- 
acceptor pairs can be used: dansyl/fluorescein; fluorescein/rhodamine; 
tryptophan/ aminocoumarin. 

The present invention provides a nuclear receptor or ligand 
binding domain thereof labeled with a fluorescent reagent for use in the 
10 above-described methods of identifying agonists and antagonists of 
nuclear receptors. The present invention also provides CBP, p300, or 
other nuclear receptor co-activator, or a binding portion thereof, labeled 
with a fluorescent reagent. 

In a particular embodiment, the nuclear receptor or ligand 
15 binding domain thereof is selected from the group consisting of PPARa, 
PPARy, PPAR5, a ligand binding domain of PPARa, PPARy, or PPAR5, 
and amino acid residues 176-478 of human PPARyl and the fluorescent 
reagent is selected from the group consisting of XL665 and Europium 
cryptate (Eu3+K). 

20 In a particular embodiment, CBP, p300, or other nuclear 

receptor co-activator is labeled with a fluorescent reagent selected from 
the group consisting of XL665 and Europium cryptate (Eu3+K). 

The following non-limiting examples are presented to better 
25 illustrate the invention. 

EXAMPLE 1 

Cloning, repressiQUi and purification of human CBP axid PPAfi 
prying 

30 To test whether human CBP can interact with PPARs in an 

agonist-dependent manner, we cloned the human cDNA fragments 
encoding the NH2-terminal 1-113 amino acids (hCBPl-113) and 1-453 

amino acids (hCBPl-453) of human CBP by the polymerase chain 
reaction (PCR). The DNA and amino acid sequences of human CBP are 
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disclosed in Borrow et al., 1996, Nature Genet. 14:33-41 and in GenBank, 
accession no. U47741. 

The primers used for hCBPl-113 were: 
5 , -ACTCGGATCCAAGCCATGGCTGAGAACTTGCTGGACGG-3 , 

5 (SEQ.ID.NO.:9) and 

5 , -CACAAAGCTTAGGCCATGTTAGCACTGTTCGG-3 , (SEQ.ID.NO.: 

10). 

These primers were expected to amplify a 0.9 kb DNA fragment. 
The primers for hCBPl-453 were: 
10 5 , -ACTCGGATCCAAGCCATGGCTGAGAACTTGCTGGACGG-3 , 

(SEQ.ID.NO.:9) and 

5 , CTCAGTCGACTTATTGAATTCCACTAGCTGGAGATCC-3 > 
(SEQ.ID.NO.:ll). 

These primers were expected to amplify a 1.5 kb DNA fragment.. 

15 The template for the PCR reaction was a human fetal brain 

cDNA library (Stratagene, Catalogue #IS 937227). Of course, any 
human cDNA library from a tissue expressing CBP could have been 
used. The PCR amplified 0.9 kb and 1.5 kp DNA fragments which were 
digested with restriction endonucleases and ligated into pBluescript II 

20 vector. DNA sequencing analysis confirmed that the amplified 

fragments were identical to the corresponding published nucleic acid 
sequences of human CBP. 

Based on the publicly available sequences for human CBP 
cited above, other primers could be readily identified and prepared by 

25 those skilled in the art in order to amplify and clone other portions of 
cDNA encoding human CBP from appropriate cDNA libraries. Once 
such portions of human CBP are produced, they could be used in the 
methods of the present invention in a manner similar to that described 
herein for hCBPl-113 and hCBPl-453. The amino acid sequence of 

30 human CBP is shown in Figure 7 A; the nucleic acid sequence of the 
cDNA encoding human CBP is shown in Figure 7B. 

To express the polypeptides encoded by the PCR fragments, 
vectors encoding fusion proteins of the polypeptides and glutathione S- 
transferase (GST) were constructed and expressed in E. colL The PCR 

35 fragments were subcloned into the expression vector pGEX (Pharmacia 
Biotech) to generate pGEXhCBPl-113 and pGEXhCBP 1-453. 
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pGEXhCBPl-113 and pGEXhCBPl-453 were transfected into the DH5ct 
strain of E. coli (GIBCO BRL) and the bacteria hosting either 
pGEXhCBPl-113 or pGEXhCBPl-453 were cultured in LB medium 
(GIBCO BRL) to a density of OD600 = 0.7-1.0 and induced for 

5 overexpression of the GST-CBP fusion proteins by addition of IPTG 
(isopropylthio-P-galactoside) to a final concentration of 0.2 mM, The 
IPTG induced cultures were further grown at room temperature for 2-5 
hrs. The cells were harvested by centrifugation for 10 min at 5000g. The 
cell pellet was used for GST-CBP fusion protein purification by following 

10 the procedure from Pharmacia Biotech using Glutathione Sepharose 
beads. hCBPl-113 and hCBPl-453 proteins were generated by cleaving 
the corresponding GST fusion proteins with thrombin. SDS- 
polyacrylamide gel electrophoresis analysis showed that the preparation 
from pGEXhCBPl-113 gave two polypeptide bands, with apparent 

15 molecular weight of 12 kd and 10 kd. The 12 kd band is the expected size 
of hCBPl-113 and the 10 kd band is most likely a premature translational 
termination product. The preparation from pGEXhCBPl-450 gave a 
single band with the expected size, 50 kd. 

cDNAs encoding full-length PPARa and PPARyl were 

20 subcloned into pGEX vectors for the production of GST-PPARa and GST- 
PPARyl fusion proteins in E.coli. PPARyl was cloned from a human fat 
cell cDNA library (see Elbrecht et al., 1996, Biochem. Biophys. Res. 
Comm. 224:431-437). A cDNA encoding the human PPARyl ligand 
binding domain (PPARylLBD; amino acids 176-478 of PPARyl) was 

25 subcloned from a modified pSG5 vector as a Xho I (site located in the N- 
terminus of the LBD)/ Xba I (site located in the pSG5 vector) fragment. 
The Xba I site was blunt-ended with T4 DNA polymerase. The 1.1 kb 
fragment containing the LBD was purified from an agarose gel and 
ligated into pGEX-KG (see Guan & Dixon, 1991, Anal. Biochem. 192:262- 

30 267) that had been digested with Xho I and Hind III (the Hind III site 
had been blunt-ended with T4 DNA polymerase). This construct was 
used for the production of GST-hPPARylLBD and hPPARylLBD (the 
ligand binding domain cleaved free of GST). The overexpression and 
purification of PPARa, PPARyl, and PPARylLBD were as described 

35 above for CBP. 
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The DNA and amino acid sequences of human PPARa are 
disclosed in Schmidt et al., 1992, Mol. Endocrinol. 6:1634-1641 and in 
GenBank, accession no. L07592. See Figure 8A and 8B. 

The DNA and amino acid sequences of human PPARyl ^ e 
disclosed in Greene et al., 1995, Gene Expr. 4:281-299; Qi et al., 1995, Mol. 
Cell. Biol. 15:1817-1825; Elbrecht et al., 1996, Biochem. Biophys. Res. 
Comm. 224:431-437; and in GenBank, accession no. L40904. See Figure 
9A and 9B. Human PPARy2 contains the same amino acid sequence as 
human PPARyl except for an amino terminal addition of 24 amino acids 
(see Elbrecht et al., 1996, Biochem. Biophys. Res. Comm. 224:431-437). 
Thus, the amino acid sequence of the ligand binding domain of human 
PPARy2 is the same as the amino acid sequence of the ligand binding 
domain of human PPARyl, although the numbering of the amino acids 
differs (176-478 for human PPARyl and 200-502 for human PPARy2). 

The DNA and amino acid sequences of human PPAR6 are 
disclosed in Sher et al., 1993, Biochemistry 32:5598-5604 and in GenBank, 
accession no. L02932. See Figure 10A-C. 

EXAMPLE 2 
Interaction between PPA Rs and hCBP fragments 

Experiments were first conducted using hCBPl-113 and 
hPPARylLBD. Purified hPPARylLBD was biotinylated with Sulfo-NHS- 
LC-Biotin (PIERCE) to a biotin:hPPARylLBD ratio of 3:1 according to the 
procedure provided by PIERCE. Purified hCBPl-113 was directly labeled 
with europium cryptate (Eu3+K) by the method illustrated in Figure 1. 
Biotin-labeled hPPARylLBD, Eu3+K-labeled hCBPl-113, and 
streptavidin-labeled XL665 (SA-XL665; from PACKARD) were incubated 
together in the presence or absence of 1 (nM of known PPARy agonist 
(BRL49653 or AD5075). 

Thus, this experimental format made use of the fluorescent 

reagent pair europium cryptate (Eu3+K), which acted as donor, and 

XL665, which acted as acceptor. hCBPl-113 was directly labeled with 

europium cryptate (Eu3+K); hPPARylLBD was indirectly labeled with 

XL665 by means of a biotin-streptavidin link. The emission maxi m um 
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of europium cryptate (Eu3+K) overlaps with the absorption maximum of 
XL665. Therefore, when europium cryptate (Eu3+K) and XL665 are in 
close proximity, and the sample is illuminated with light at 337 nm (the 
absorption maximum of europium cryptate (Eu3+K)), FRET can occur 
5 between europium cryptate (Eu3+K) and XL665. This FRET manifests 
itself as increased emission at 665 nm by XL665. Figure 2 shows a 
schematic of the format used in this experiment (experiment 1 of Table 
1). When agonist is bound to hPPARylLBD, a specific interaction occurs 
between hPPARylLBD and hCBPl-113, thus bringing europium cryptate 

10 (Eu3+K) and XL665 into close enough proximity for FRET to occur. In 
the absence of agonist, no interaction occurs between hPPARylLBD and 
hCBPl-113 and thus europium cryptate (Eu3+K) and XL665 are not 
brought into close proximity and no FRET occurs. When FRET occurs, 
the amount of light given off by the sample at the emission maximum of 

15 XL665 (665 nm) is increased relative to the amount of light given off by 
the sample at the emission maximum of europium cryptate (Eu3+K) 
(620 nm). Therefore, measuring the ratio of emission at 665 nm to 620 
nm in the presence and the absence of a substance suspected of being an 
agonist allows for the determination of whether that substance actually 

20 is an agonist. If the substance is an agonist, an increase in the ratio of 
emission at 665 nm to 620 nm in the presence of the substance will be 
observed. 

Reactions were carried out in microtiter plates. Reaction 
conditions were: appropriate volume (total 250 pi) of the reaction buffer 

25 (either PBS or HEPES, see below, containing 500 mM KF, 0.1% bovine 
serum albumin, BSA) was added to each well, followed by addition of 
ligands (BRL49653 or AD5075 at a final concentration of 1 nM and 0.1% 
dimethylsulfoxide (DMSO) or vehicle control (0.1% DMSO), Eu3+K 
labeled hCBP (100 nM), biotin-hPPARylLBD (100 nM), and streptavidin- 

30 labeled XL665 (100 nM) to appropriate wells. After mixing, 200 |il of 
reaction mixture was transferred to a new well. The plate was either 
directly measured for fluorescence resonance energy transfer (FRET) or 
covered with sealing tape (PACKARD) to avoid evaporation and 
incubated at room temperature for up to 24 hrs before measuring FRET. 

35 The results of this experiment and others described below 

yielded ratio values as follows: 
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Table 1 



Experiment 


Buffer 


Emission ratio 
with AD5075 


Emission ratio 
with vehicle 


1 


PBS 


1134 


1074 


2 


HEPES + 0.05% 
NP40 


967 


617 


3 


HEPES + 0.05% 
NP40 


1078 


536 


4 


HEPES + 0.05% 
CHAPS 


1883 


487 
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Experiment 1 of Table 1 was carried out using PBS (137 mM 
NaCl, 2.7 mM KC1, 4.3 mM Na2HP04, 1-4 mM KH2PO4, pH 7.4). The 
greater emission ratio observed in the presence of AD5075 demonstrated 
that a specific interaction between hCBPl-113 and hPPARylLBD 

5 occurred in the presence of the agonist AD5075. Although it was clear 
that FRET was occurring, the signal-noise ratio was small. In 
experiment 2 of Table 1, HEPES buffer (N-2-hy<lroxyethylpiperazine-N , - 
2-ethane sulfonic acid, 100 mM, pH 7.0) containing 0.05% NP40 (Nonidet 
P-40) was used instead of PBS and an improved signal-noise ratio was 

10 obtained. 

In order to get an even better signal-noise ratio, the above- 
described format was modified slightly for experiment 3. In experiment 
3, SA-XL665 (500 nM), biotin-labeled hPPARylLBD (100 nM), GST- 
hCBPl-113, and Eu3+K labeled anti-GST antibody (2.5 pi) were incubated 

15 in the presence or absence of AD5075 (1 pM) in HEPES buffer containing 
0.05% NP40. A two-fold signal- noise ratio was obtained. Figure 3 shows 
a schematic of the format used in experiment 3. 

The anti-GST antibody was a goat antibody to GST from 
Pharmacia (catalogue number 27-4577-01) that was labeled with Eu3+K 

20 according to the procedure summarized below. 

- Make up @ 10 
mg/mL in H20. 
Need 42.2 pg (4.2 
pL, 96.6 nmol) for 
49.0 pg Eu3+ 
Reagent 

Na0 3 S-< v ^ « W O 
O FW = 436.4 



NH, 



-Resuspended @ -FW = 1465 2.9 Equiv SULFO-SMCC 

2.5 mg/mL in 10% Use 49.0 pg 20 mM Pi buffer, 10% DMF 

DMF/PBS (19.6 uL, 33.4 nmol) RT, 30 minutes 
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5 equiv. Eu3+ 
complex 



Anti-GST Antibody, 
Cat #27-4577-01 




1) Add Eu reagent 

2) [Pr] 4.2 mg/mL. 
Onight at 4°C. 

Lower pH: Add 12 pL of 
1M NaPi, pH 7.0. 
pH drops to 7.18. 

350 pM TCEP 

(35 mM stock is 

10.0 mg/mL- PBS, pH 

7.0), 2.4 pL, 

15 min rt then 15 min @ 

4°C 



Anti-GST Antibody, 
Cat #27-4577-01 



NH 5 



From Pharmacia, 5.0 mg/mL, 
FW = 150 kD Use 200 pL (1 mg, 
6.66 nmol) exchange into 10 mM 
Borate, 350 mM NaCl, 10% Gly, 
pH 8.5 with BioSpin-30 



5.0 Equiv SPDP, 
RT, 5 hours 



FW=312, Dissolve 
@ 1.00 mg/mL in EtOH. 
Add 10.4 pL (5 equiv., 
10.4 pg, 33.4 nmol) to 
protein. 



To further improve the signal to noise ratio, a series of 
experiments were conducted. Experiment 4 of Table 1 exemplifies 
results obtained from those efforts. cDNA encoding a longer fragment of 
hCBP was cloned and expressed to get hCBPl-453. hCBPl-453 was 
biotinylated. Biotin-labeled hCBPl-453 (25 nM), SA-XL665 (100 nM), 
GST-hPPARylLBD (1 nM), and Eu3+K-labeled anti-GST antibody (2 nM) 
were mixed together in the presence or absence of 1 |iM AD5075. The 
detergent was changed from 0.05% NP40 to 0.5% CHAPS (3-{[3- 
cholamidopropyl]dimethyl-ammoniol}-l-propanesulfonate). A three- to 
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four-fold signal-noise ratio was obtained. Figure 4 shows the strategy 
used for experiment 4 and similar experiments. 

The correlation between results from the above-described 
assays and previously reported results from in vitro binding and 
transcriptional activation assays of selected antidiabetic insulin 
sensitizers that are known to be PPARy agonists (Elbrecht et al., 1996, 
Biochem Biophys Res Comm 224:431-437) was analyzed by titrating those 
known PPARy agonists in the assays described above and comparing 
EC50s so obtained with previously described values for potency in 
binding or transcriptional activation assays for the known agonists. The 
results are shown in Figure 5. From Figure 5, the following EC50s can 
be derived: 

AD5075 =8nM 

BRL49653 = 53 nM 

Troglitazone = 646 nM 

Pioglitazone = 890 nM. 
These EC50S generated in the above-described assays are in close 
agreement with those generated by in vitro binding and transcriptional 
activation studies (Elbrecht et al., 1996, Biochem Biophys Res Comm 
224:431-437). 

The above-described assay can also be used to characterize 
the interaction between nuclear receptors with co-activators as, e,g. , by 
determining the binding constant for that interaction. Figure 6 shows 
an example of such an application. Saturating amounts of PPARy 
agonist (10 jaM BRL49653) were used. Increasing concentrations of non- 
biotinylated hCBPl-453 were used to titrate away biotin-hCBP- 
PPARylLBD complex and decrease the fluorescence energy transfer. A 
Kd of 300 nM for the interaction between hCBPl-453 and PPARylLBD 
can be derived from the results illustrated in Figure 6 and this Kd (300 
nM) is a measurement ofthe affinity between CBP and PPARy. 

The present invention is not to be limited in scope by the 
specific embodiments described herein. Indeed, various modifications 
of the invention in addition to those described herein will become 
apparent to those skilled in the art from the foregoing description. Such 
modifications are intended to fall within the scope ofthe appended 
claims. 
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Various publications are cited herein, the disclosures of 
which are incorporated by reference in their entireties. 
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WHAT IS CLAIMED: 

1. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
5 labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; and 

(c) a substance suspected of being an agonist of the 
nuclear receptor; 

10 tinder conditions such that, if the substance is an agonist of 

the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and CBP, p300, or other nuclear receptor co- 
activator, or a binding portion thereof, will occur; and 

(d) measuring fluorescence resonance energy transfer 
15 (FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

2. The method of claim 1 where the nuclear receptor or 
20 ligand binding domain thereof is selected from the group consisting of 

steroid receptors, thyroid hormone receptors, retinoic acid receptors, 
peroxisome proliferator-activated receptors, retinoid X receptors, 
glucocorticoid receptors, vitamin D receptors, LXR, and FXR. 

25 3. The method of claim 1 where the nuclear receptor or 

ligand binding domain thereof is selected from the group consisting of a 
full-length nuclear receptor, a ligand binding domain of a nuclear 
receptor, and an AF-2 site of a nuclear receptor. 

30 4. The method of claim 1 where the nuclear receptor or 

ligand binding domain thereof comprises an AF-2 site of a nuclear 
receptor. 

5. The method of claim 1 where the nuclear receptor or 
35 ligand binding domain thereof is selected from the group consisting of a 
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full-length PPAR, a ligand binding domain of a PPAR, and amino acid 
residues 176-478 of human PPARyl. 

6. The method of claim 1 where the nuclear receptor or 
5 ligand binding domain thereof is selected from the group consisting of 

PPARa, PPARyl, PPAR72, and PPAR8. 

7. The method of claim 1 where the nuclear receptor or 
ligand binding domain thereof comprises a ligand binding domain 

10 selected from the group consisting of amino acids 143-462 of human 

RARoc, amino acids 122-410 of rat T3R0CI, amino acids 227-463 of mouse 
RXRy, and amino acids 251-595 of human ER. 

8. The method of claim 1 where CBP, p300, or other 
15 nuclear receptor co-activator, or a binding portion thereof is selected 

from the group consisting of full-length human CBP, full-length mouse 
CBP, amino acid residues 1-113 of human CBP, and amino acid residues 
1-453 of human CBP. 

20 9. The method of claim 1 where the first fluorescent 

reagent is selected from the group consisting of XL665 and Europium 
cryptate (Eu3+K). 

10. The method of claim 1 where the second fluorescent 
25 reagent is selected from the group consisting of XL665 and Europium 

cryptate (Eu3+K). 

11. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 

30 (a) a nuclear receptor or ligand binding domain thereof 

labeled with a first fluorescent reagent; 

(b) a binding portion of a nuclear receptor co-activator, 
where the binding portion contains the amino acid sequence LXXLL, 
and where the binding portion is labeled with a second fluorescent 

35 reagent; and 
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(c) a substance suspected of being an agonist of the 
nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
5 binding domain thereof and the binding portion of the nuclear receptor 
co-activator will take place; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
10 is an agonist of the nuclear receptor. 



12. The method of claim 11 where the binding portion of a 
nuclear receptor co-activator is selected from the group consisting of 
human RIP-140, human SRC-1, mouse TIF-2, human or mouse CBP, 
15 human or mouse p300, mouse TIF-1, and human TRIP proteins. 



13. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
20 labeled with a first fluorescent reagent; 

(b) a polypeptide containing the amino acid sequence 
LXXLL where the polypeptide is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
25 nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the polypeptide will take place; and 

(d) measuring fluorescent resonance energy transfer 
30 (FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 



14. A method for identifying an antagonist of a nuclear 
35 receptor that comprises providing: 
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(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; 

5 (c) an agonist of the nuclear receptor; and 

(d) a substance suspected of being an antagonist of the 
nuclear receptor; 

tinder conditions such that, in the absence of the substance, 
binding between the nuclear receptor or ligand binding domain thereof 
10 and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof will occur; and 

(e) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents when the 
substance is present and measuring FRET between the first and second 

15 fluorescent reagents when the substance is absent; 

where the a decrease in FRET when the substance is 
present indicates that the substance is an antagonist of the nuclear 
receptor. 

20 15. The method of claim 14 where the nuclear receptor or 

ligand binding domain thereof is selected from the group consisting of 
steroid receptors, thyroid hormone receptors, retinoic acid receptors, 
peroxisome proliferator-activated receptors, retinoid X receptors, 
glucocorticoid receptors, vitamin D receptors, LXR, and FXR. 

25 

16. The method of claim 14 where the nuclear receptor or 
ligand binding domain thereof is selected from the group consisting of a 
full-length nuclear receptor, a ligand binding domain of a nuclear 
receptor, and an AF-2 site of a nuclear receptor. 

30 

17. The method of claim 14 where the nuclear receptor or 
ligand binding domain thereof comprises an AF-2 site of a nuclear 
receptor. 

35 18. The method of claim 14 where the nuclear receptor or 

ligand binding domain thereof is selected from the group consisting of a 
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full-length PPAR, a ligand binding domain of a PPAR, and amino acid 
residues 176-478 of human PPARyl. 

19. The method of claim 14 where the nuclear receptor or 
5 ligand binding domain thereof is selected from the group consisting of 

PPARa, PPARyl, PPARy2, and PPAR8. 

20. The method of claim 14 where the nuclear receptor or 
ligand binding domain thereof comprises a ligand binding domain 

10 selected from the group consisting of amino acids 143-462 of human 

RARa, amino acids 122-410 of ratT3Ral, amino acids 227-463 of mouse 
RXRy, and amino acids 251-595 of human ER. 

21. The method of claim 14 where CBP, p300, or other 
15 nuclear receptor co-activator, or a binding portion thereof is selected 

from the group consisting of full-length CBP, amino acid residues 1-113 
of human CBP, and amino acid residues 1-453 of human CBP. 

22. The method of claim 14 where the first fluorescent 
20 reagent is selected from the group consisting of XL665 and Europium 

cryptate (Eu3+K). 

23. The method of claim 14 where the second fluorescent 
reagent is selected from the group consisting of XL665 and Europium 

25 cryptate (Eu3+K). 

24. A nuclear receptor or ligand binding domain thereof 
labeled with a fluorescent reagent. 

30 25. The nuclear receptor or ligand binding domain 

thereof of claim 24 where the nuclear receptor or ligand binding domain 
thereof is selected from the group consisting of PPARa, PPARyl, 
PPAR72, PPAR5, a ligand binding domain of PPARa, PPARyl, PPARy2, 
or PPAR8, and amino acid residues 176-478 of human PPARyl and the 

35 fluorescent reagent is selected from the group consisting of XL665 and 
Europium cryptate (Eu3+K). 
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26. CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a fluorescent reagent. 

5 27. The CBP, p300, or other nuclear receptor co-activator, 

or a binding portion thereof, of claim 26 where the fluorescent reagent is 
selected from the group consisting of XL665 and Europium cryptate 
(Eu3+KX 
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1 MAENLLDGPPNPKRAKLSSPGFSANDSTDFGSLFDLENDLPDELIPNGGELGLLNSGNLV 
6 1 PDAASKHKQLSELLRGGSGSS INPGI GN VSASSP VQQGLGGQAQGQPNSANMASLSAMGK 
121 SPLSQGDSSAPSLPKQAASTSGPTPAASQALNPQAQKQVGLATSSPATSQTGPGICMNAN 
181 FNQTHPGLLNSNSGHSLINQASQGQAQVMNGSLGAAGRGRGAGMPYPTPAMQGASSSVLA 
241 ETLTQVSPQMTGHAGLNTAQAGGMAKMGITGNTSPFGQPFSQAGGQPMGATGVNPQLASK 
301 QSMVNSLPTFPTDIKNTSVTNVPNMSQMQTSVGIVPTQAIATGPTADPEKRKLIQQQLVL 
361 LLHAHKCQRREQANGEVRACSLPHCRTMKNVLNHMTHCQAGKACQ 

FIG.7A 



1 cgagccccga cccccgtccg ggccctcgcc 

61 cccgagcagg tgaaaatggc tgagaacttg 

121 aaactcagct cgcccggttt ctcggcgaat 

181 ttggaaaatg atcttcctga tgagctgata 

241 agtgggaacc ttgttccaga tgctgcttcc 

301 ggaggcagcg gctctagtat caacccagga 

361 cagcagggcc tgggtggcca ggctcaaggg 

421 agtgccatgg gcaagagccc tctgagccag 

481 caggcagcca gcacctctgg gcccaccccc 

541 caaaagcaag tggggctggc gactagcagc 

601 tgcatgaatg ctaactttaa ccagacccac 

661 agcttaatta atcaggcttc acaagggcag 

721 gctggcagag gaaggggagc tggaatgccg 

781 agcagcgtgc tggctgagac cctaacgcag 

841 ctgaacaccg cacaggcagg aggcatggcc 

901 tttggacagc cctttagtca agctggaggg 

961 cagttagcca gcaaacagag catggtcaac 

1021 aatacttcag tcaccaacgt gccaaatatg 

1081 cccacacaag caattgcaac aggccccact 

1141 cagcagctgg ttctactgct tcatgctcat 

1201 gaggttcggg cctgctcgct cccgcattgt 

1261 acgcattgtc aggctgggaa agcctgccaa 

FIG.7B 



ggccgcgccg cccgtgcccg gggctgtttt 
ctggacggac cgcccaaccc caaaagagcc 
gacagcacag attttggatc attgtttgac 
cccaatggag gagaattagg ccttttaaac 
aaacataaac aactgtcgga gcttctacga 
ataggaaatg tgagcgccag cagccccgtg 
cagccgaaca gtgctaacat ggccagcctc 
ggagattctt cagcccccag cctgcctaaa 
gctgcctccc aagcactgaa tccgcaagca 
cctgccacgt cacagactgg acctggtatc 
ccaggcctcc tcaatagtaa ctctggccat 
gcgcaagtca tgaatggatc tcttggggct 
taccctactc cagccatgca gggcgcctcg 
gtttccccgc aaatgactgg tcacgcggga 
aagatgggaa taactgggaa cacaagtcca 
cagccaatgg gagccactgg agtgaacccc 
agtttgccca ccttccctac agatatcaag 
tctcagatgc aaacatcagt gggaattgta 
gcagatcctg aaaaacgcaa actgatacag 
aagtgtcaga gacgagagca agcaaacgga 
cgaaccatga aaaacgtttt gaatcacatg 
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1 MVDTESPLCPLSPLEAGDLESPLSEEFLQEMGNIQEISQSIGEDSSGSFGFTEYQYLGSC 
61 PGSDGSVITDTLSPASSPSSVTYPVVPGSVDESPSGALNIECRICGDKASGYHYGVHACE 
121 GCKGFFRRTIRLKLVYDKCDRSCK1QKKNRNKCQYCRFHKCLSVGMSHNAIRFGRMPRSE 
181 KAKLKAEILTCEHOIEDSETADLKSLAKRIYEAYLKNFNMNKVKARVILSGKASNNPPFV 
241 IHDMETLCMAEKTLVAKLVANGIQNKEVEVRIFHCCQCTSVETVTELTEFAKAIPAFANL 
301 DLNDQVTLLKYGVYEAIFAMLSSVMNKDGMLVAYGNGFITREFLKSURKPFCDIMEPKFD 
361 FAMKFNALELDDSDISLFVAAIICCGDRPGLLNVGHIEKMQEGIVHVURLHLQSNHPDDI 
421 FLPKLLQKMADLRQLVTEHAQLVQIIKKTESDAALHPLLQEIYRDMY 

FIG.8A 

1 ggcccaggct gaagctcagg gccctgtctg ctctgtggac tcaacagttt gtggcaagac 
61 aagctcagaa ctgagaagct gtcaccacag ttctggaggc tgggaagttc aagatcaaag 
121 tgccagcaga ttcagtgtca tgtgaggacg tgcttcctgc ttcatagata agagtagctt 
181 ggagctcggc ggcacaacca gcaccatctg gtcgcgatgg tggacacgga aagcccactc 
241 tgccccctct ccccactcga ggccggcgat ctagagagcc cgttatctga agagttcctg 
301 caagaaatgg gaaacatcca agagatttcg caatccatcg gcgaggatag ttctggaagc 
361 tttggcttta cggaatacca gtatttagga agctgtcctg gctcagatgg ctcggtcatc 
421 acggacacgc tttcaccagc ttcgagcccc tcctcggtga cttatcctgt ggtccccggc 
481 agcgtggacg agtctcccag tggagcattg aacatcgaat gtagaatctg cggggacaag 
541 gcctcaggct atcattacgg agtccacgcg tgtgaaggct gcaagggctt ctttcggcga 
601 acgattcgac tcaagctggt gtatgacaag tgcgaccgca gctgcaagat ccagaaaaag 
661 aacagaaaca aatgccagta ttgtcgattt cacaagtgcc tttctgtcgg gatgtcacac 
721 aacgcgattc gttttggacg aatgccaaga tctgagaaag caaaactgaa agcagaaatt 
781 cttacctgtg aacatgacat agaagattct gaaactgcag atctcaaatc tctggccaag 
841 agaatctacg aggcctactt gaagaacttc aacatgaaca aggtcaaagc ccgggtcatc 
901 ctctcaggaa aggccagtaa caatccacct tttgtcatac atgatatgga gacactgtgt 
961 atggctgaga agacgctggt ggccaagctg gtggccaatg gcatccagaa caaggaggtg 
1021 gaggtccgca tctttcactg ctgccagtgc acgtcagtgg agaccgtcac ggagctcacg 
1081 gaattcgcca aggccatccc agcgttcgca aacttggacc tgaacgatca agtgacattg 
1141 ctaaaatacg gagtttatga ggccatattc gccatgctgt cttctgtgat gaacaaagac 
1201 gggatgctgg tagcgtatgg aaatgggttt ataactcgtg aattcctaaa aagcctaagg 
1261 aaaccgttct gtgatatcat ggaacccaag tttgattttg ccatgaagtt caatgcactg 
1321 gaactggatg acagtgatat ctcccttttt gtggctgcta tcatttgctg tggagatcgt 
1381 cctggccttc taaacgtagg acacattgaa aaaatgcagg agggtattgt acatgtgctc 
1441 agactccacc tgcagagcaa ccacccggac gatatctttc tcttcccaaa acttcttcaa 
1501 aaaatggcag acctccggca gctggtgacg gagcatgcgc agctggtgca gatcatcaag 
1561 aagacggagt cggatgctgc gctgcacccg ctactgcagg agatctacag ggacatgtac 
1621 tgagttcctt cagatcagcc acaccttttc caggagttct gaagctgaca gcactacaaa 
1681 ggagacgggg gagcagcacg attttgcaca aatatccacc actttaacct tagagcttgg 
1741 acagtctgag ctgtaggtaa ccggcatatt attccatatc tttgttttaa ccagtacttc 
1801 taagagcata gaactcaaat gctgggggag gtggctaatc tcaggactgg gaag 

FIG.8B 



WO 99/18124 



PCT/US98/21049 



8/11 

1 MTMVDTEIAFWPTNFGISSVDLSVMEDHSHSFDIKPFTTVDFSSISTPHYEDIPFTRTDP 
61 VVADYKYDLKLQEYQSAIKVEPASPPYYSEKTQLYNKPHEEPSNSLMAIECRVCGDKASG 
121 FH YGVHACEGCKGFFRRTI RLKL I YDRCDLNCRI HKKSRNKCQYCRFQKCLAVGMSHNAI 
181 RFGRIAQAEKEKLLAEISSDIDQLNPESADLRQALAKHLYDSYIKSFPLTKAKARAILTG 
241 KTTDKSPFVIYDMNSLMMGEDKI KFKH ITPLQEQSKE VAI R I FQGCQFRS VEAVQE ITE Y 
301 AKS I PGFVNLDLNDQVTLLKYGVHE I I YTMLASLMNKDGVL I SEGQGFMTREFLKSLRKP 
361 FGDFMEPKFEFAVKFNALELDDSDLAIFIAVIILSGDRPGLLNVKPIEDIQDNLLQALEL 
421 QLKLNHPESSQUAKLLQKMTDLRQIVTEHVQLLQVIKKTETDMSLHPLLQEIYKDLY 

FIG.9A 



1 ccgaccttac cccaggcggc cttgacgttg gtcttgtcgg caggagacag caccatggtg 

61 ggttctctct gagtctggga attcccgagc ccgagccgca gccgccgcct ggggggcttg 

121 ggtcggcctc gaggacaccg gagaggggcg ccacgccgcc gtggccgcag aaatgaccat 

181 ggttgacaca gagatcgcat tctggcccac caactttggg atcagctccg tggatctctc 

241 cgtaatggaa gaccactccc actcctttga tatcaagccc ttcactactg ttgacttctc 

301 cagcatttct actccacatt acgaagacat tccattcaca agaacagatc cagtggttgc 

361 agattacaag tatgacctga aacttcaaga gtaccaaagt gcaatcaaag tggagcctgc 

421 atctccacct tattattctg agaagactca gctctacaat aagcctcatg aagagccttc 

481 caactccctc atggcaattg aatgtcgtgt ctgtggagat aaagcttctg gatttcacta 

541 tggagttcat gcttgtgaag gatgcaaggg tttcttccgg agaacaatca gattgaagct 

601 tatctatgac agatgtgatc ttaactgtcg gatccacaaa aaaagtagaa ataaatgtca 

661 gtactgtcgg tttcagaaat gccttgcagt ggggatgtct cataatgcca tcaggtttgg 

721 gcggatcgca caggccgaga aggagaagct gttggcggag atctccagtg atatcgacca 

781 gctgaatcca gagtccgctg acctccgtca ggccctggca aaacatttgt atgactcata 

841 cataaagtcc ttcccgctga ccaaagcaaa ggcgagggcg atcttgacag gaaagacaac 

901 agacaaatca ccattcgtta tctatgacat gaattcctta atgatgggag aagataaaat 

961 caagttcaaa cacatcaccc ccctgcagga gcagagcaaa gaggtggcca tccgcatctt 

1021 tcagggctgc cagtttcgct ccgtggaggc tgtgcaggag atcacagagt atgccaaaag 

1081 cattcctggt tttgtaaatc ttgacttgaa cgaccaagta actctcctca aatatggagt 

1141 ccacgagatc atttacacaa tgctggcctc cttgatgaat aaagatgggg ttctcatatc 

1201 cgagggccaa ggcttcatga caagggagtt tctaaagagc ctgcgaaagc cttttggtga 

1261 ctttatggag cccaagtttg agtttgctgt gaagttcaat gcactggaat tagatgacag 

1321 cgacttggca atatttattg ctgtcattat tctcagtgga gaccgcccag gtttgctgaa 

1381 tgtgaagccc attgaagaca ttcaagacaa cctgctacaa gccctggagc tccagctgaa 

1441 gctgaaccac cctgagtcct cacagctgtt tgccaagctg ctccagaaaa tgacagacct 

1501 cagacagatt gtcacggaac acgtgcagct actgcaggtg atcaagaaga cggagacaga 

1561 catgagtctt cacccgctcc tgcaggagat ctacaaggac ttgtactagc agagagtcct 

1621 gagccactgc caacatttcc cttcttccag ttgcactatt ctgagggaaa atctgaccat 

1681 aagaaattta ctgtgaaaaa gcgttttaaa aagaaaaggg tttagaatat gatctatttt 

1741 atgcatattg tttataaaga cacatttaca atttactttt aatattaaaa attaccatat 

1801 tatgaaattg c 

FIG.9B 
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1 MEQPQEEAPEVREEEEKEEVAEAEGAPELNGGPQHALPSSSYTDLSRSSSPPSLLDQLQM 

61 GCPGASCGSLNMECRVCGDKASGFHYGVHACEGCKGFFRRTIRMKLEYEKCERSCKIQKK 

121 NRNKCQYCRFQKCLALGMSHNAIRFGRMPEAEKRKLVAGLTANEGSQYNPOVADLKAFSK 

181 HIYNAYLKNFNMTKKKARSILTGKASHTAPFVIHOIETLWQAEKGLVWKQLVNGLPPYKE 

241 ISVHVFYRCQCTTVETVRELTEFAK5IPSFSSLFLNDQVTLLKYGVHEAIFAMLASIVNK 

301 DGLLVANGSGFVTREFLRSLRKPFSDI IEPKFEFAVKFNALELDDSDLALFIAAI ILCGD 

361 RPGLMNVPRVEAIQDTILRALEFHLQANHPDAQYLFPKLLQKMADLRQLVTEHAQMMQRI 

421 KKTETETSLHPLLQEIYKDMY 



FIG. 1 0A 
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1 gaattctgcg gagcctgcgg gacggcggcg ggttggcccg taggcagccg ggacagtgtt 

61 gtacagtgtt ttgggcatgc acgtgatact cacacagtgg cttctgctca ccaacagatg 

121 aagacagatg caccaacgag ggtctggaat ggtctggagt ggtctggaaa gcagggtcag 

181 atacccctgg aaaactgaag cccgtggagc aatgatctct acaggactgc ttcaaggctg 

241 atgggaacca ccctgtagag gtccatctgc gttcagaccc agacgatgcc agagctatga 

301 ctgggcctgc aggtgtggcg ccgaggggag atcagccatg gagcagccac aggaggaagc 

361 ccctgaggtc cgggaagagg aggagaaaga ggaagtggca gaggcagaag gagccccaga 

421 gctcaatggg ggaccacagc atgcacttcc ttccagcagc tacacagacc tctcccggag 

481 ctcctcgcca ccctcactgc tggaccaact gcagatgggc tgtgacgggg cctcatgcgg 

541 cagcctcaac atggagtgcc gggtgtgcgg ggacaaggca tcgggcttcc actacggtgt 

601 tcatgcatgt gaggggtgca agggcttctt ccgtcgtacg atccgcatga agctggagta 

661 cgagaagtgt gagcgcagct gcaagattca gaagaagaac cgcaacaagt gccagtactg 

721 ccgcttccag aagtgcctgg cactgggcat gtcacacaac gctatccgtt ttggtcggat 

781 gccggaggct gagaagagga agctggtggc agggctgact gcaaacgagg ggagccagta 

841 caacccacag gtggccgacc tgaaggcctt ctccaagcac atctacaatg cctacctgaa 

901 aaacttcaac atgaccaaaa agaaggcccg cagcatcctc accggcaaag ccagccacac 

961 ggcgcccttt gtgatccacg acatcgagac attgtggcag gcagagaagg ggctggtgtg 

1021 gaagcagttg gtgaatggcc tgcctcccta caaggagatc agcgtgcacg tcttctaccg 

1081 ctgccagtgc accacagtgg agaccgtgcg ggagctcact gagttcgcca agagcatccc 

1141 cagcttcagc agcctcttcc tcaacgacca ggttaccctt ctcaagtatg gcgtgcacga 

1201 ggccatcttc gccatgctgg cctctatcgt caacaaggac gggctgctgg tagccaacgg 

1261 cagtggcttt gtcacccgtg agttcctgcg cagcctccgc aaacccttca gtgatatcat 

1321 tgagcctaag tttgaatttg ctgtcaagtt caacgccctg gaacttgatg acagtgacct 

1381 ggccctattc attgcggcca tcattctgtg tggagaccgg ccaggcctca tgaacgttcc 

1441 acgggtggag gctatccagg acaccatcct gcgtgccctc gaattccacc tgcaggccaa 

1501 ccaccctgat gcccagtacc tcttccccaa gctgctgcag aagatggctg acctgcggca 

1561 actggtcacc gagcacgccc agatgatgca gcggatcaag aagaccgaaa ccpagacctc 

1621 gctgcaccct ctgctccagg agatctacaa ggacatgtac taacggcggc acccaggcct 

1681 ccctgcagac tccaatgggg ccagcactgg aggggcccac ccacatgact tttccattga 

1741 ccagctctct tcctgtcttt gttgtctccc tctttctcag ttcctctttc ttttctaatt 

1801 cctgttgctc tgtttcttcc tttctgtagg tttctctctt cccttctccc ttctcccttg 

1861 ccctcccttt ctctctccta tccccacgtc tgtcctcctt tcttattctg tgagatgttt 

1921 tgtattattt caccagcagc atagaacagg acctctgctt ttgcacacct tttccccagg 

1981 agcagaagag agtgggcctg ccctctgccc catcattgca cctgcaggct taggtcctca 

2041 cttctgtctc ctgtcttcag agcaaaagac ttgagccatc caaagaaaca ctaagctctc 

2101 tgggcctggg ttccagggaa ggctaagcat ggcctggact gactgcagcc ccctatagtc 

2161 atggggtccc tgctgcaaag gacagtggca gaccccggca gtagagccga gatgcctccc 

2221 caagactgtc attgcccctc cgatcgtgag gccacccact gacccaatga tcctctccag 

2281 cagcacacct cagccccact gacacccagt gtccttccat cttcacactg gtttgccagg 

2341 ccaatgttgc tgatggcccc tccagcacac acacataagc actgaaatca ctttacctgc 

2401 aggcaccatg cacctccctt ccctccctga ggcaggtgag aacccagaga gaggggcctg 

FIG.10B 
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2461 caggtgagca ggcagggctg ggccaggtct 
2521 tgggtcagcc cagcacctcg cccagtggga 
2581 ctgatgtcca tttgtcccaa tagctctact 
2641 tggccaccta gaagtctccc tgcacagcct 
2701 ccacaccgct ggtccctgcc ctcccctgct 
2761 gggccaaagc acagctgggc atgccatgtc 
2821 gggcaagggg ctggctggag tctcagagca 
2881 ggcttcctgg gtcctgcctg gtcctccctc 
2941 gaagtgccca gcccctgcct gacggnnnnn 
3001 cgctccccac ctacccagct gatgggggtt 
3061 ggacagctgc tgggacccac ctcccccctt 
3121 cccgggtctg gtgctgagga tacagctctt 
3181 atgtatattt ttgctaggag ccccagcttc 
3241 actgacgaaa ctttaaataa atgggaatta 
3301 c 
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ccggggaggc aggggtcctg caggtcctgg 
gcttcccggg ataaactgag cctgttcatt 
gccctcccct tcccctttac tcagcccagc 
ctagtgtccg gggaccttgt gggaccagtc 
cccaggttga ggtgcgctca cctcagagca 
tgagcggcgc agagccctcc aggcctgcag 
cagaggtagg agaactgggg ttcaagccca 
ccaaggagcc attctatgtg actctgggtg 
nngatcactc tctgctggca ggattcttcc 
ggggtgcttc tttcagccaa ggctatgaag 
ccccggccac atgccgcgtc cctgccccca 
ctcagtgtct gaacaatctc caaaattgaa 
ctgtgttttt aatataaata gtgtacacag 
aatatttaaa aaaaaaagcg gccgcgaatt 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 
(i) APPLICANT: Merck & Co., Inc. 

(ii) TITLE OF INVENTION: ASSAYS FOR NUCLEAR RECEPTOR 

AGONISTS AND ANTAGONISTS USING FLUORESCENCE RESONANCE 
ENERGY TRANSFER 

(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merck & Co., Inc. 

(B) STREET: P.O. Box 2000, 126 E. Lincoln Ave. 

(C) CITY: Rahway 

(D) STATE: NJ 

(E) COUNTRY: USA 

(F) ZIP: 07065-0900 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(vixi) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Coppola, Joseph A 

(B) REGISTRATION NUMBER: 38,413 

(C) REFERENCE / DOCKET NUMBER: 20017 PCT 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 732-594-6734 
(B} TELEFAX: 732-594-4720 
(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



Met 


Ala 


Glu 


Asn 


Leu 


Leu 


Asp 


Gly 


Pro 


Pro 


Asn 


Pro 


Lys 


Arg 


Ala 


Lys 


1 








5 










10 










15 


Leu 


Ser 


Ser 


Pro 


Gly 


Phe 


Ser 


Ala 


Asn 


Asp 


Ser 


Thr 


Asp 


Phe 


Gly 


Ser 








20 










25 










30 




Leu 


Phe 


Asp 


Leu 


Glu 


Asn 


Asp 


Leu 


Pro 


Asp 


Glu 


Leu 


He 


Pro 


Asn Gly 






35 










40 










45 








Gly 


Glu 


Leu 


Gly 


Leu 


Leu 


Asn 


Ser Gly 


Asn 


Leu 


Val 


Pro 


Asp 


Ala 


Ala 




50 










55 










60 








Ser 


Lys 


His 


Lys 


Gin 


Leu 


Ser 


Glu 


Leu 


Leu 


Arg 


Gly 


Gly 


Ser Gly 


Ser 


65 










70 










75 










80 


Ser 


He 


Asn 


Pro 


Gly 
85 


He 


Gly 


Asn 


Val 


Ser 
90 


Ala 


Ser 


Ser 


Pro 


Val 
95 


Gin 


Gin 


Gly 


Leu 


Gly 
100 


Gly 


Gin 


Ala 


Gin 


Gly 
105 


Gin 


Pro 


Asn 


Ser 


Ala 
110 


Asn 


Met 


Ala 


Ser 


Leu 
115 


Ser 


Ala 


Met 


Gly 


Lys 
120 


Ser 


Pro 


Leu 


Ser 


Gin 
125 


Gly 


Asp 


Ser 


Ser 


Ala 
130 


Pro 


Ser 


Leu 


Pro 


Lys 
135 


Gin 


Ala 


Ala 


Ser 


Thr 
140 


Ser 


Gly 


Pro 


Thr 


Pro 


Ala 


Ala 


Ser 


Gin 


Ala 


Leu 


Asn 


Pro 


Gin 


Ala 


Gin 


Lys 


Gin 


Val 


Gly 


145 










150 










155 








160 


Leu 


Ala 


Thr 


Ser 


Ser 


Pro 


Ala 


Thr 


Ser 


Gin 


Thr 


Gly 


Pro 


Gly 


He 


Cys 










165 










170 










175 


Met 


Asn 


Ala 


Asn 


Phe 


Asn 


Gin 


Thr 


His 


Pro Gly 


Leu 


Leu 


Asn 


Ser 


Asn 








180 










185 










190 






Ser 


Gly 


His 
195 


Ser 


Leu 


He 


Asn 


Gin 
200 


Ala 


Ser 


Gin 


Gly 


Gin 
205 


Ala 


Gin 


Val 


Met 


Asn 


Gly 


Ser 


Leu 


Gly 


Ala 


Ala Gly 


Arg Gly 


Arg 


Gly Ala Gly Met 




210 










215 










220 










Pro 


Tyr 


Pro 


Thr 


Pro 


Ala 


Met 


Gin 


Gly 


Ala 


Ser 


Ser 


Ser 


Val 


Leu 


Ala 


225 










230 










235 










240 


Glu 


Thr 


Leu 


Thr 


Gin 
245 


Val 


Ser 


Pro 


Gin 


Met 
250 


Thr 


Gly 


His 


Ala 


Gly 
255 


Leu 


Asn 


Thr 


Ala 


Gin 


Ala 


Gly 


Gly 


Met 


Ala 


Lys 


Met 


Gly 


He 


Thr 


Gly Asn 








260 










265 










270 






Thr 


Ser 


Pro 


Phe 


Gly 


Gin 


Pro 


Phe 


Ser 


Gin 


Ala 


Gly 


Gly Gin 


Pro 


Met 






275 










280 








285 








Gly 


Ala 
290 


Thr 


Gly 


Val 


Asn 


Pro 
295 


Gin 


Leu 


Ala 


Ser 


Lys 
300 


Gin 


Ser 


Met 


Val 


Asn 


Ser 


Leu 


Pro 


Thr 


Phe 


Pro 


Thr 


Asp 


He 


Lys 


Asn 


Thr 


Ser 


Val 


Thr 


305 










310 










315 










320 


Asn 


Val 


Pro 


Asn 


Met 
325 


Ser 


Gin 


Met 


Gin 


Thr 
330 


Ser 


Val 


Gly 


He 


Val 
335 


Pro 


Thr 


Gin 


Ala 


He 
340 


Ala 


Thr 


Gly 


Pro 


Thr 
345 


Ala 


Asp 


Pro 


Glu 


Lys 
350 


Arg 


Lys 


Leu 


He 


Gin 


Gin 


Gin 


Leu 


Val 


Leu 


Leu 


Leu 


His 


Ala 


His 


Lys 


Cys 


Gin 






355 










360 










365 






Arg 


Arg 
370 


Glu 


Gin 


Ala 


Asn 


Gly 
375 


Glu 


Val 


Arg 


Ala 


Cys 
380 


Ser 


Leu 


Pro 


His 


Cys 


Arg 


Thr 


Met 


Lys 


Asn 


Val 


Leu 


Asn 


His 


Met 


Thr 


His 


Cys 


Gin 


Ala 


385 










390 










395 








400 


Gly 


Lys 


Ala 


Cys 


Gin 
405 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

CGAGCCCCGA CCCCCGTCCG GGCCCTCGCC GGCCGCGCCG CCCGTGCCCG GGGCTGTTTT 60 

CCCGAGCAGG TGAAAATGGC TGAGAACTTG CTGGACGGAC CGCCCAACCC CAAAAGAGCC 120 

AAACTCAGCT CGCCCGGTTT CTCGGCGAAT GACAGCACAG ATTTTGGATC ATTGTTTGAC 180 

TTGGAAAATG ATCTTCCTGA TGAGCTGATA CCCAATGGAG GAGAATTAGG CCTTTTAAAC 240 

AGTGGGAACC TTGTTC C AG A TGCTGCTTCC AAACATAAAC AACTGTCGGA GCTTCTACGA 300 

GGAGGCAGCG GCTCTAGTAT CAACCCAGGA ATAGGAAATG TGAGCGCCAG CAGCCCCGTG 360 

CAGCAGGGCC TGGGTGGCCA GGCTCAAGGG CAGCCGAACA GTGCTAACAT GGCCAGCCTC 420 

AGTGCCATGG GCAAGAGCCC TCTGAGCCAG GGAGATTCTT CAGCCCCCAG CCTGCCTAAA 480 

CAGGCAGCCA GCACCTCTGG GCCCACCCCC GCTGCCTCCC AAGC AC TGAA TCCGCAAGCA 540 

CAAAAGCAAG TGGGGCTGGC GACTAGCAGC CCTGCCACGT CACAGACTGG AC CTGGTATC 600 

TGCATGAATG CTAACTTTAA CCAGACCCAC CCAGGCCTCC TCAATAGTAA CTCTGGCCAT 660 

AGCTTAATTA ATCAGGCTTC ACAAGGGCAG GCGCAAGTCA TGAATGGATC TCTTGGGGCT 720 

GCTGGCAGAG GAAGGGGAGC TGGAATGCCG TACCCTACTC CAGCCATGCA GGGCGCCTCG 780 

AGCAGCGTGC TGGCTGAGAC CCTAACGCAG GTTTCCCCGC AAATGACTGG TCACGCGGGA 840 

CTGAACACCG CACAGGCAGG AGGCATGGCC AAGATGGGAA TAACTGGGAA CACAAGTCCA 900 

TTTGGACAGC CCTTTAGTCA AGCTGGAGGG CAGCCAATGG GAGCCACTGG AGTGAACCCC 960 

CAGTTAGCCA GCAAACAGAG CATGGTCAAC AGTTTGCCCA CCTTCCCTAC AGATATCAAG 1020 

AATACTTCAG TCACCAACGT GCCAAATATG TCTCAGATGC AAACATCAGT GGGAATTGTA 1080 

CCCACACAAG CAATTGCAAC AGGCCCCACT GCAGATCCTG AAAAACGCAA ACTGATACAG 1140 

CAGCAGCTGG TTCTACTGCT TCATGCTCAT AAGTGTCAGA GACGAGAGCA AGCAAACGGA 1200 

GAGGTTCGGG CCTGCTCGCT CCCGCATTGT CGAACCATGA AAAACGTTTT GAATCACATG 1260 

ACGCATTGTC AGGCTGGGAA AGCCTGCCAA 129 0 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Val Asp Thr Glu Ser Pro Leu Cys Pro Leu Ser Pro Leu Glu Ala 

15 10 15 

Gly Asp Leu Glu Ser Pro Leu Ser Glu Glu Phe Leu Gin Glu Met Gly 

20 25 30 

Asn lie Gin Glu lie Ser Gin Ser lie Gly Glu Asp Ser Ser Gly Ser 

35 40 45 

Phe Gly Phe Thr Glu Tyr Gin Tyr Leu Gly Ser Cys Pro Gly Ser Asp 
50 55 60 
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Gly 


Ser 


Val 


lie 


Thr 


Asp 


Thr 


Leu 


Ser 


Pro 


Ala 


Ser 


Ser 


Pro 


Ser 


Ser 


65 










70 










75 










80 


Val 


Thr 


Tyr 


Pro 


Val 


Val 


Pro 


Gly 


Ser 


Val 


Asp 


Glu 


Ser 


Pro 


Ser 


Gly 










85 










90 










95 




Ala 


Leu 


Asn 


He 


Glu 


Cys 


Arg 


He 


Cys 


Gly 


Asp 


Lys 


Ala 


Ser 


Gly 


Tyr 








100 










105 










110 






His 


Tyr 


Gly Val 


His 


Ala 


Cys 


Glu 


Gly 


Cys 


Lys 


Gly 


Phe 


Phe 


Arg 


Arg 






115 










120 










125 








Thr 


He 


Arg 


Leu 


Lys 


Leu 


Val 


Tyr 


Asp 


Lys 


Cys 


Asp 


Arg 


Ser 


Cys 


Lys 




130 










135 










140 










He 


Gin 


Lys 


Lys 


Asn 


Arg 


Asn 


Lys 


Cys 


Gin 


Tyr 


Cys 


Arg 


Phe 


His 


Lys 


145 










150 










155 










160 


Cys 


Leu 


Ser 


Val 


Gly 


Met 


Ser 


His 


Asn 


Ala 


He 


Arg 


Phe 


Gly 


Arg 


Met 










165 










170 










175 




Pro 


Arg 


Ser 


Glu 


Lys 


Ala 


Lys 


Leu 


Lys 


Ala 


Glu 


He 


Leu 


Thr 


Cys 


Glu 








180 










185 










190 






His 


Asp 


He 


Glu 


Asp 


Ser 


Glu 


Thr 


Ala 


Asp 


Leu 


Lys 


Ser 


Leu 


Ala 


Lys 






195 










200 










205 








Arg 


He 


Tyr Glu 


Ala 


Tyr 


Leu 


Lys 


Asn 


Phe 


Asn 


Met 


Asn 


Lys 


Val 


Lys 




210 










215 










220 










Ala 


Arg 


Val 


He 


Leu 


Ser 


Gly 


Lys 


Ala 


Ser 


Asn 


Asn 


Pro 


Pro 


Phe 


Val 


225 










230 










235 










240 


He 


His 


Asp Met 


Glu 


Thr 


Leu 


Cys 


Met 


Ala 


Glu 


Lys 


Thr 


Leu 


Val 


Ala 










245 










250 










255 




Lys 


Leu 


Val 


Ala 


Asn 


Gly 


He 


Gin 


Asn 


Lys 


Glu 


Val 


Glu 


Val 


Arg 


He 








260 










265 










270 






Phe 


His 


Cys 


Cys 


Gin 


Cys 


Thr 


Ser 


Val 


Glu 


Thr 


Val 


Thr 


Glu 


Leu 


Thr 






275 










280 










285 








Glu 


Phe 


Ala 


Lys 


Ala 


He 


Pro 


Ala 


Phe 


Ala 


Asn 


Leu 


Asp 


Leu 


Asn 


Asp 




290 










295 










300 








Gin 


Val 


Thr 


Leu 


Leu 


Lys 


Tyr 


Gly 


Val 


Tyr 


Glu 


Ala 


He 


Phe 


Ala 


Met 


305 










310 










315 










320 


Leu 


Ser 


Ser 


Val 


Met 


Asn 


Lys 


Asp 


Gly 


Met 


Leu 


Val 


Ala 


Tyr 


Gly 


Asn 










325 










330 










335 




Gly 


Phe 


He 


Thr 


Arg 


Glu 


Phe 


Leu 


Lys 


Ser 


Leu 


Arg 


Lys 


Pro 


Phe 


Cys 








340 










345 










350 






Asp 


He 


Met 


Glu 


Pro 


Lys 


Phe 


Asp 


Phe 


Ala 


Met 


Lys 


Phe 


Asn 


Ala 


Leu 






355 










360 










365 








Glu 


Leu 


Asp 


Asp 


Ser 


Asp 


He 


Ser 


Leu 


Phe 


Val 


Ala 


Ala 


He 


He 


Cys 




370 










375 










380 










Cys 


Gly 


Asp Arg 


Pro 


Gly 


Leu 


Leu 


Asn 


Val 


Gly 


His 


He 


Glu 


Lys 


Met 


385 










390 










395 










400 


Gin 


Glu 


Gly 


He 


Val 


His 


Val 


Leu 


Arg 


Leu 


His 


Leu 


Gin 


Ser 


Asn 


His 










405 










410 










415 




Pro 


Asp 


Asp 


lie 


Phe 


Leu 


Phe 


Pro 


Lys 


Leu 


Leu 


Gin 


Lys 


Met 


Ala 


Asp 








420 










425 










430 






Leu 


Arg 


Gin 


Leu 


Val 


Thr 


Glu 


His 


Ala 


Gin 


Leu 


Val 


Gin 


He 


He 


Lys 






435 










440 










445 






Lys 


Thr 


Glu 


Ser 


Asp 


Ala 


Ala 


Leu 


His 


Pro 


Leu 


Leu 


Gin 


Glu 


He 


Tyr 




450 










455 










460 








Arg 


Asp 


Met 


Tyr 



























465 



(2) INFORMATION FOR SEQ ID N0:4: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GGCCCAGGCT GAAGCTCAGG GCCCTGTCTG CTCTGTGGAC TCAACAGTTT GTGGCAAGAC 60 

AAGCTCAGAA CTGAGAAGCT GTCACCACAG TTCTGGAGGC TGGGAAGTTC AAGATCAAAG 120 

TGCCAGCAGA TTCAGTGTCA TGTGAGGACG TGCTTCCTGC TTCATAGATA AGAGTAGCTT 180 

GGAGCTCGGC GGCACAACCA GCACCATCTG GTCGCGATGG TGGACACGGA AAGCCCACTC 240 

TGCCCCCTCT CCCCACTCGA GGCCGGCGAT CTAGAGAGCC CGTTATCTGA AGAGTTCCTG 3 00 

CAAGAAATGG GAAACATCCA AGAGATTTCG CAATCCATCG GCGAGGATAG TTCTGGAAGC 360 

TTTGGCTTTA CGGAATACCA GTATTTAGGA AGCTGTCCTG GCTCAGATGG CTCGGTCATC 420 

ACGGACACGC TTTCACCAGC TTCGAGCCCC TCCTCGGTGA CTTATCCTGT GGTCCCCGGC 480 

AGCGTGGACG AGTCTCCCAG TGGAGCATTG AACATCGAAT GTAGAATCTG CGGGGACAAG 540 

GCCTCAGGCT ATCATTACGG AGTCCACGCG TGTGAAGGCT GCAAGGGCTT CTTTCGGCGA 600 

ACGATTCGAC TCAAGCTGGT GTATGACAAG TGCGACCGCA GCTGCAAGAT CCAGAAAAAG 660 

AACAGAAACA AATGCCAGTA TTGTCGATTT CACAAGTGCC TTTCTGTCGG GATGTCACAC 720 

AACGCGATTC GTTTTGGACG AATGCCAAGA TCTGAGAAAG CAAAACTGAA AGCAGAAATT 780 

CTTACCTGTG AACATGACAT AGAAGATTCT GAAACTGCAG ATCTCAAATC TCTGGCCAAG 840 

AGAATCTACG AGGCCTACTT GAAGAACTTC AACATGAACA AGGTCAAAGC CCGGGTCATC 900 

CTCTCAGGAA AGGCCAGTAA CAATCCACCT TTTGTCATAC ATGATATGGA GACACTGTGT 960 

ATGGCTGAGA AGACGCTGGT GGCCAAGCTG GTGGCCAATG GCATCCAGAA CAAGGAGGTG 1020 

GAGGTC CGC A TCTTTCACTG CTGCCAGTGC ACGTCAGTGG AGACCGTCAC GGAGCTCACG 1080 

GAATTCGCCA AGGCCATCCC AGCGTTCGCA AACTTGGACC TGAACGATCA AGTGACATTG 1140 

CTAAAATACG GAGTTTATGA GGCCATATTC GCCATGCTGT CTTCTGTGAT GAACAAAGAC 1200 

GGGATGCTGG TAGCGTATGG AAATGGGTTT ATAACTCGTG AATTCCTAAA AAGCCTAAGG 1260 

AAACCGTTCT GTGATATCAT GGAACCCAAG TTTGATTTTG CCATGAAGTT CAATGCACTG 1320 

GAACTGGATG ACAGTGATAT CTCCCTTTTT GTGGCTGCTA TCATTTGCTG TGGAGATCGT 13 80 

CCTGGCCTTC TAAACGTAGG ACACATTGAA AAAATGCAGG AGGGTATTGT ACATGTGCTC 1440 

AGACTCCACC TGCAGAGCAA CCACCCGGAC GATATCTTTC TCTTCCCAAA ACTTCTTCAA 1500 

AAAATGGCAG ACCTCCGGCA GCTGGTGACG GAGCATGCGC AGCTGGTGCA GATCATCAAG 1560 

AAGACGGAGT CGGATGCTGC GCTGCACCCG CTACTGCAGG AGATCTACAG GGACATGTAC 1620 

TGAGTTCCTT CAGATCAGCC ACACCTTTTC CAGGAGTTCT GAAGCTGACA GCACTACAAA 1680 

GGAGACGGGG GAGCAGCACG ATTTTGCACA AATATCCACC ACTTTAACCT TAGAGCTTGG 1740 

ACAGTCTGAG CTGTAGGTAA CCGGC AT ATT ATTCCATATC TTTGTTTTAA CCAGTACTTC 1800 

TAAGAGCATA GAACTCAAAT GCTGGGGGAG GTGGC TAATC TCAGGACTGG GAAG 1854 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



Met Thr Met Val Asp Thr Glu He Ala Phe Trp Pro Thr Asn Phe Gly 
15 10 15 
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lie 


Ser 


Ser 


Val 
20 


Asp 


Leu 


Ser 


Val 


Asp 


lie 


Lys 
35 


Pro 


Phe 


Thr 


Thr 


Val 
40 


His 


Tyr 
50 


Glu 


Asp 


lie 


Pro 


Phe 
55 


Thr 


Tyr 


Lys 


Tyr 


Asp 


Leu 


Lys 


Leu 


Gin 


65 










70 






Glu 


Pro 


Ala 


Ser 


Pro 
85 


Pro 


Tyr 


Tyr 


Lys 


Pro 


His 


Glu 
100 


Glu 


Pro 


Ser 


Asn 


Val 


Cys 


Gly 
115 


Asp 


Lys 


Ala 


Ser 


Gly 
120 


Glu 


Gly 
130 


Cys 


Lys 


Gly 


Phe 


Phe 
135 


Arg 


Tyr 


Asp 


Arg 


Cys 


Asp 


Leu 


Asn 


Cys 


145 










150 






Lys 


Cys 


Gin 


Tyr 


Cys 
165 


Arg 


Phe 


Gin 


His 


Asn 


Ala 


lie 
180 


Arg 


Phe 


Gly 


Arg 


Leu 


Leu 


Ala 
195 


Glu 


He 


Ser 


Ser 


Asp 
200 


Ala 


Asp 
210 


Leu 


Arg 


Gin 


Ala 


Leu 
215 


Ala 


Lys 


Ser 


Phe 


Pro 


Leu 


Thr 


Lys 


Ala 


225 










230 






Lys 


Thr 


Thr 


Asp 


Lys 
245 


Ser 


Pro 


Phe 


Met 


Met 


Gly 


Glu 
260 


Asp 


Lys 


He 


Lys 


Glu 


Gin 


Ser 
275 


Lys 


Glu 


Val 


Ala 


He 
280 


Arg 


Ser 
290 


Val 


Glu 


Ala 


Val 


Gin 
295 


Glu 


Pro 


Gly 


Phe 


Val 


Asn 


Leu 


Asp 


Leu 


305 










310 






Tyr 


Gly 


Val 


His 


Glu 
325 


He 


He 


Tyr 


Lys 


Asp 


Gly 


Val 
340 


Leu 


He 


Ser 


Glu 


Phe 


Leu 


Lys 
355 


Ser 


Leu 


Arg 


Lys 


Pro 
360 


Phe 


Glu 
370 


Phe 


Ala 


Val 


Lys 


Phe 
375 


Asn 


Leu 


Ala 


lie 


Phe 


He 


Ala 


Val 


lie 


385 










390 






Leu 


Leu 


Asn 


Val 


Lys 
405 


Pro 


He 


Glu 


Ala 


Leu 


Glu 


Leu 
420 


Gin 


Leu 


Lys 


Leu 


Phe 


Ala 


Lys 
435 


Leu 


Leu 


Gin 


Lys 


Met 
440 


Glu 


His 
450 


Val 


Gin 


Leu 


Leu 


Gin 
455 


Val 



Met 


Glu 


Asp 


His 


Ser 


His 


Ser 


Phe 


25 










30 






Asp 


Phe 


Ser 


Ser 


He 


Ser 


Thr 


Pro 










45 








Arg 


Thr 


Asp 


Pro 


Val 


Val 


Ala 


Asp 








60 










Glu 


Tyr 


Gin 


Ser 


Ala 


He 


Lys 


Val 






75 










80 


Ser 


Glu 


Lys 


Thr 


Gin 


Leu 


Tyr 


Asn 




90 










95 




Ser 


Leu 


Met 


Ala 


He 


Glu 


Cys 


Arg 


105 










110 






Phe 


His 


Tyr 


Gly 


Val 


His 


Ala 


Cys 










125 








Arg 


Thr 


He 


Arg 


Leu 


Lys 


Leu 


He 








140 










Arg 


He 


His 


Lys 


Lys 


Ser 


Arg 


Asn 






155 










160 


Lys 


Cys 


Leu 


Ala 


Val 


Gly 


Met 


Ser 




170 










175 




He 


Ala 


Gin 


Ala 


Glu 


Lys 


Glu 


Lys 


185 










190 






He 


Asp 


Gin 


Leu 


Asn 


Pro 


Glu 


Ser 










205 








Lys 


His 


Leu 


Tyr 


Asp 


Ser 


Tyr 


He 








220 










Lys 


Ala 


Arg 


Ala 


He 


Leu 


Thr 


Gly 






235 










240 


Val 


He 


Tyr 


Asp 


Met 


Asn 


Ser 


Leu 




250 










255 




Phe 


Lys 


His 


He 


Thr 


Pro 


Leu 


Gin 


265 










270 






Arg 


lie 


Phe 


Gin 


Gly 


Cys 


Gin 


Phe 










285 








He 


Thr 


Glu 


Tyr 


Ala 


Lys 


Ser 


He 








300 










Asn 


Asp 


Gin 


Val 


Thr 


Leu 


Leu 


Lys 






315 










320 


Thr 


Met 


Leu 


Ala 


Ser 


Leu 


Met 


Asn 




330 










335 




Gly 


Gin 


Gly 


Phe 


Met 


Thr 


Arg 


Glu 


345 










350 






Phe 


Gly 


Asp 


Phe 


Met 


Glu 


Pro 


Lys 










365 








Ala 


Leu 


Glu 


Leu 


Asp 


Asp 


Ser 


Asp 








380 










He 


Leu 


Ser 


Gly 


Asp 


Arg 


Pro 


Gly 






395 










400 


Asp 


lie 


Gin 


Asp 


Asn 


Leu 


Leu 


Gin 




410 










415 




Asn 


His 


Pro 


Glu 


Ser 


Ser 


Gin 


Leu 


425 










43 0 






Thr 


Asp 


Leu 


Arg 


Gin 


He 


Val 


Thr 










445 








He 


Lys 


Lys 


Thr 


Glu 


Thr 


Asp 


Met 



460 
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Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys Asp Leu Tyr 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

CCGACCTTAC CCCAGGCGGC CTTGACGTTG GTCTTGTCGG CAGGAGACAG CACCATGGTG 60 

GGTTCTCTCT GAGTCTGGGA ATTCCCGAGC CCGAGCCGCA GCCGCCGCCT GGGGGGCTTG 12 0 

GGTCGGC CTC GAGGACACCG GAGAGGGGCG CCACGCCGCC GTGGCCGCAG AAATGACCAT 180 

GGTTGACACA GAGATCGCAT TCTGGCCCAC CAACTTTGGG ATCAGCTCCG TGGATCTCTC 240 

CGTAATGGAA GACCACTCCC ACTCCTTTGA TATCAAGCCC TTCACTACTG TTGACTTCTC 300 

CAGCATTTCT ACTCCACATT ACGAAGACAT TCCATTCACA AGAACAGATC CAGTGGTTGC 360 

AGATTACAAG TATGACCTGA AACTTCAAGA GTACCAAAGT GCAATCAAAG TGGAGCCTGC 420 

ATCTCCACCT TATTATTCTG AGAAGACTCA GCTCTACAAT AAGCCTCATG AAGAGCCTTC 480 

CAACTCCCTC ATGGCAATTG AATGTCGTGT CTGTGGAGAT AAAGCTTCTG GATTTCACTA 540 

TGGAGTTCAT GCTTGTGAAG GATGCAAGGG TTTCTTCCGG AGAACAATCA GATTGAAGCT 600 

TATCTATGAC AGATGTGATC TTAACTGTCG GATC C AC AAA AAAAGTAGAA ATAAATGTCA 660 

GTACTGTCGG TTTCAGAAAT GCCTTGCAGT GGGGATGTCT CATAATGCCA TCAGGTTTGG 72 0 

GCGGATCGCA CAGGCCGAGA AGGAGAAGCT GTTGGCGGAG ATCTCCAGTG ATATCGACCA 780 

GCTGAATCCA GAGTCCGCTG ACCTCCGTCA GGC CCTGGC A AAACATTTGT ATGACTCATA 840 

CATAAAGTCC TTCCCGCTGA CCAAAGCAAA GGCGAGGGCG ATCTTGACAG GAAAGACAAC 900 

AGACAAATCA CCATTCGTTA TCTATGACAT GAATTCCTTA ATGATGGGAG AAGATAAAAT 960 

CAAGTTCAAA CACATCACCC CCCTGCAGGA GCAGAGCAAA GAGGTGGCCA TCCGCATCTT 1020 

TCAGGGCTGC CAGTTTCGCT CCGTGGAGGC TGTGCAGGAG ATCACAGAGT ATGCCAAAAG 1080 

CATTCCTGGT TTTGTAAATC TTGACTTGAA CGACCAAGTA ACTCTCCTCA AATATGGAGT 114 0 

CCACGAGATC ATTTACACAA TGCTGGCCTC CTTGATGAAT AAAGATGGGG TTCTCATATC 1200 

CGAGGGCCAA GGCTTCATGA CAAGGGAGTT TCTAAAGAGC CTGCGAAAGC CTTTTGGTGA 12 60 

CTTTATGGAG CCCAAGTTTG AGTTTGCTGT GAAGTTCAAT GCACTGGAAT TAGATGACAG 1320 

CGACTTGGCA ATATTTATTG CTGTCATTAT TCTCAGTGGA GACCGCCCAG GTTTGCTGAA 1380 

TGTGAAGCCC ATTGAAGACA TTCAAGACAA CCTGCTACAA GCCCTGGAGC TCCAGCTGAA 1440 

GCTGAACCAC CCTGAGTCCT CACAGCTGTT TGCCAAGCTG CTCCAGAAAA TGACAGACCT 1500 

CAGACAGATT GTCACGGAAC ACGTGCAGCT ACTGCAGGTG ATCAAGAAGA CGGAGACAGA 1560 

CATGAGTCTT CACCCGCTCC TGCAGGAGAT CTACAAGGAC TTGTACTAGC AGAGAGTCCT 1620 

GAGCCACTGC CAACATTTCC CTTCTTCCAG TTGCACTATT CTGAGGGAAA ATCTGACCAT 1680 

AAGAAATTTA CTGTGAAAAA GCGTTTTAAA AAGAAAAGGG TTTAGAATAT GATCTATTTT 1740 

ATGCATATTG TTTATAAAGA CACATTTACA ATTTACTTTT AATATTAAAA ATTACCATAT 1800 

TATGAAATTG C 1811 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met Glu Gin Pro Gin Glu Glu Ala Pro Glu Val Arg Glu Glu Glu Glu 

1 5 10 15 

Lys Glu Glu Val Ala Glu Ala Glu Gly Ala Pro Glu Leu Asn Gly Glv 

20 25 30 

Pro Gin His Ala Leu Pro Ser Ser Ser Tyr Thr Asp Leu Ser Arg Ser 

35 40 45 

Ser Ser Pro Pro Ser Leu Leu Asp Gin Leu Gin Met Gly Cys Asp Gly 

50 55 60 

Ala Ser Cys Gly Ser Leu Asn Met Glu Cys Arg Val Cys Gly Asp Lys 
65 70 75 80 

Ala Ser Gly Phe His Tyr Gly Val His Ala Cys Glu Gly Cys Lys Gly 

85 90 95 

Phe Phe Arg Arg Thr He Arg Met Lys Leu Glu Tyr Glu Lys Cys Glu 

100 105 no 

Arg Ser Cys Lys He Gin Lys Lys Asn Arg Asn Lys Cys Gin Tyr Cys 

115 120 125 

Arg Phe Gin Lys Cys Leu Ala Leu Gly Met Ser His Asn Ala He Arg 

130 135 140 

Phe Gly Arg Met Pro Glu Ala Glu Lys Arg Lys Leu Val Ala Gly Leu 
145 150 155 160 

Thr Ala Asn Glu Gly Ser Gin Tyr Asn Pro Gin Val Ala Asp Leu Lys 

165 170 175 

Ala Phe Ser Lys His He Tyr Asn Ala Tyr Leu Lys Asn Phe Asn Met 

180 185 190 

Thr Lys Lys Lys Ala Arg Ser He Leu Thr Gly Lys Ala Ser His Thr 

195 200 205 

Ala Pro Phe Val He His Asp He Glu Thr Leu Trp Gin Ala Glu Lys 

210 215 220 

Gly Leu Val Trp Lys Gin Leu Val Asn Gly Leu Pro Pro Tyr Lys Glu 
225 230 235 240 

He Ser Val His Val Phe Tyr Arg Cys Gin Cys Thr Thr Val Glu Thr 

245 250 255 

Val Arg Glu Leu Thr Glu Phe Ala Lys Ser He Pro Ser Phe Ser Ser 

260 265 270 

Leu Phe Leu Asn Asp Gin Val Thr Leu Leu Lys Tyr Gly Val His Glu 

275 280 285 

Ala He Phe Ala Met Leu Ala Ser lie Val Asn Lys Asp Gly Leu Leu 

290 295 300 

Val Ala Asn Gly Ser Gly Phe Val Thr Arg Glu Phe Leu Arg Ser Leu 
305 310 315 320 

Arg Lys Pro Phe Ser Asp He He Glu Pro Lys Phe Glu Phe Ala Val 

325 330 335 

Lys Phe Asn Ala Leu Glu Leu Asp Asp Ser Asp Leu Ala Leu Phe He 

340 345 350 

Ala Ala He He Leu Cys Gly Asp Arg Pro Gly Leu Met Asn Val Pro 

355 360 365 

Arg Val Glu Ala He Gin Asp Thr He Leu Arg Ala Leu Glu Phe His 

370 375 380 

Leu Gin Ala Asn His Pro Asp Ala Gin Tyr Leu Phe Pro Lys Leu Leu 
385 390 395 400 

Gin Lys Met Ala Asp Leu Arg Gin Leu Val Thr Glu His Ala Gin Met 
405 410 415 
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Met 



Leu 



Gin Arg lie Lys Lys Thr Glu Thr 
420 425 
Gin Glu lie iyr Lys Asp Met Tyr 



Glu Thr Ser Leu His Pro Leu 
430 



435 440 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GAATTCTGCG GAGCCTGCGG GACGGCGGCG GGTTGGCCCG TAGGCAGCCG GGACAGTGTT 60 

GTACAGTGTT TTGGGCATGC ACGTGATACT CACACAGTGG CTTCTGCTCA CCAACAGATG 120 

AAGACAGATG CACCAACGAG GGTCTGGAAT GGTCTGGAGT GGTCTGGAAA GCAGGGTCAG 180 

ATACCCCTGG AAAACTGAAG CCCGTGGAGC AATGATCTCT ACAGGACTGC TTCAAGGCTG 240 

ATGGGAACCA CCCTGTAGAG GTCCATCTGC GTTCAGACCC AGACGATGCC AGAGCTATGA 300 

CTGGGCCTGC AGGTGTGGCG CCGAGGGGAG ATCAGCCATG GAGCAGCCAC AGGAGGAAGC 360 

CCCTGAGGTC CGGGAAGAGG AGGAGAAAGA GGAAGTGGCA GAGGCAGAAG GAGCCCCAGA 420 

GCTCAATGGG GGACCACAGC ATGCACTTCC TTCCAGCAGC TACACAGACC TCTCCCGGAG 480 

CTCCTCGCCA CCCTCACTGC TGGACCAACT GCAGATGGGC TGTGACGGGG CCTCATGCGG 540 

CAGCCTCAAC ATGGAGTGCC GGGTGTGCGG GGACAAGGCA TCGGGCTTCC ACTACGGTGT 600 

TCATGCATGT GAGGGGTGCA AGGGC TTCTT CCGTCGTACG ATCCGCATGA AGCTGGAGTA 660 

CGAGAAGTGT GAGCGCAGCT GCAAGATTCA GAAGAAGAAC CGCAACAAGT GCCAGTACTG 720 

CCGCTTCCAG AAGTGCCTGG C AC TGGGC AT GTCACACAAC GCTATCCGTT TTGGTCGGAT 780 

GCCGGAGGCT GAGAAGAGGA AGCTGGTGGC AGGGCTGACT GCAAACGAGG GGAGCCAGTA 840 

CAACCCACAG GTGGCCGACC TGAAGGCCTT CTCCAAGCAC ATCTACAATG CCTACCTGAA 900 

AAACTTCAAC ATGACCAAAA AGAAGGCCCG CAGCATCCTC ACCGGCAAAG- CCAGCCACAC 960 

GGCGCCCTTT GTGATCCACG ACATCGAGAC ATTGTGGCAG GCAGAGAAGG GGCTGGTGTG 1020 

GAAGCAGTTG GTGAATGGCC TGCCTCCCTA CAAGGAGATC AGCGTGCACG TCTTCTACCG 1080 

CTGCCAGTGC ACCACAGTGG AGACCGTGCG GGAGCTCACT GAGTTCGCCA AGAGCATCCC 1140 

CAGCTTCAGC AGCCTCTTCC TCAACGACCA GGTTAC CCTT CTCAAGTATG GCGTGCACGA 1200 

GGCCATCTTC GCCATGCTGG CCTCTATCGT CAACAAGGAC GGGCTGCTGG TAGCCAACGG 1260 

CAGTGGCTTT GTCACCCGTG AGTTCCTGCG CAGCCTCCGC AAACCCTTCA GTGATATCAT 1320 

TGAGCCTAAG TTTGAATTTG CTGTCAAGTT CAACGCCCTG GAACTTGATG ACAGTGACCT 1380 

GGCCCTATTC ATTGCGGCCA TCATTCTGTG TGGAGACCGG CCAGGCCTCA TGAACGTTCC 1440 

ACGGGTGGAG GCTATC C AGG ACACCATCCT GCGTGCCCTC GAATTCCACC TGCAGGCCAA 1500 

CCACCCTGAT GCCCAGTACC TCTTCCCCAA GCTGCTGCAG AAGATGGCTG ACCTGCGGCA 1560 

ACTGGTCACC GAGCACGCCC AGATGATGCA GCGGATCAAG AAGACCGAAA CCGAGACCTC 1620 

GCTGCACCCT CTGCTCCAGG AGATCTACAA GGACATGTAC TAACGGCGGC ACCCAGGCCT 1680 

CCCTGCAGAC TCCAATGGGG CCAGCACTGG AGGGGCCCAC CCACATGACT TTTCCATTGA 1740 

CCAGCTCTCT TCCTGTCTTT GTTGTCTCCC TCTTTCTCAG TTCCTCTTTC TTTTCTAATT 1800 

CCTGTTGCTC TGTTTCTTCC TTTCTGTAGG TTTCTCTCTT CCCTTCTCCC TTCTCCCTTG 1860 

CCCTCCCTTT CTCTCTCCTA TCCCCACGTC TGTCCTCCTT TCTTATTCTG TGAGATGTTT 1920 

TGTATTATTT CACCAGCAGC ATAGAACAGG ACCTCTGCTT TTGCACACCT TTTCCCCAGG 1980 

AGCAGAAGAG AGTGGGCCTG CCCTCTGCCC CATCATTGCA CCTGCAGGCT TAGGTCCTCA 2040 

CTTCTGTCTC CTGTCTTCAG AGCAAAAGAC TTGAGCCATC CAAAGAAACA CTAAGCTCTC 2100 

TGGGCCTGGG TTCCAGGGAA GGCTAAGCAT GGCCTGGACT GACTGCAGCC CCCTATAGTC 2160 

ATGGGGTCCC TGCTGCAAAG GACAGTGGCA GACCCCGGCA GTAGAGCCGA GATGCCTCCC 2220 

CAAGACTGTC ATTGCCCCTC CGATCGTGAG GCCACCCACT GACCCAATGA TCCTCTCCAG 2280 

CAGCACACCT CAGCCCCACT GACACCCAGT GTC CTTCC AT CTTCACACTG GTTTGCCAGG 2340 
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CCAATGTTGC TGATGGCCCC TCCAGCACAC ACACATAAGC ACTGAAATCA CTTTACCTGC 2400 

AGGCACCATG CACCTCCCTT CCCTCCCTGA GGCAGGTGAG AACCCAGAGA GAGGGGCCTG 2460 

CAGGTGAGCA GGCAGGGCTG GGCCAGGTCT CCGGGGAGGC AGGGGTCCTG CAGGTCCTGG 2520 

TGGGTCAGCC CAGCACCTCG CCCAGTGGGA GCTTCCCGGG ATAAACTGAG CCTGTTCATT 2580 

CTGATGTCCA TTTGTCCCAA TAGCTCTACT GCCCTCCCCT TCCCCTTTAC TCAGCCCAGC 2640 

TGGCCACCTA GAAGTCTCCC TGCACAGCCT CTAGTGTCCG GGGACCTTGT GGGACCAGTC 2700 

CCACACCGCT GGTCCCTGCC CTCCCCTGCT CCCAGGTTGA GGTGCGCTCA CCTCAGAGCA 2760 

GGGCCAAAGC ACAGCTGGGC ATGCCATGTC TGAGCGGCGC AGAGCCCTCC AGGCCTGCAG 2820 

GGGCAAGGGG CTGGCTGGAG TCTCAGAGCA CAGAGGTAGG AGAACTGGGG TTCAAGCCCA 2880 

GGCTTCCTGG GTCCTGCCTG GTCCTCCCTC CCAAGGAGCC ATTCTATGTG ACTCTGGGTG 2940 

GAAGTGCCCA GCCCCTGCCT GACGGNNNNN NNGATCACTC TCTGCTGGCA GGATTCTTCC 3000 

CGCTCCCCAC CTACCCAGCT GATGGGGGTT GGGGTGCTTC TTTCAGCCAA GGCTATGAAG 3 060 

GGACAGCTGC TGGGACC C AC CTCCCCCCTT CCCCGGCCAC ATGCCGCGTC CCTGCCCCCA 3120 

CCCGGGTCTG GTGCTGAGGA TACAGCTCTT CTCAGTGTCT GAACAATCTC CAAAATTGAA 3180 

ATGTATATTT TTGCTAGGAG CCCCAGCTTC CTGTGTTTTT AATATAAATA GTGT AC AC AG 3240 

ACTGACGAAA CTTTAAATAA ATGGGAATTA AATATTTAAA AAAAAAAGCG GCCGCGAATT 3300 

C 3301 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE : nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
ACTCGGATCC AAGCCATGGC TGAGAACTTG CTGGACGG 38 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CACAAAGCTT AGGCCATGTT AGCACTGTTC GG 32 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTCAGTCGAC TTATTGAATT CCACTAGCTG GAGATCC 37 
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